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ABSTRACT 

We  measure  the  large-scale  real-space  power  spectrum  P(k)  by  using  a  sample  of  205,443  galaxies  from  the 
Sloan  Digital  Sky  Survey,  covering  2417  effective  square  degrees  with  mean  redshift  0.1.  We  employ  a 
matrix-based  method  using  pseudo-Karhunen-Loeve  eigenmodes,  producing  uncorrelated  minimum-variance 
measurements  in  22  /(-bands  of  both  the  clustering  power  and  its  anisotropy  due  to  redshift-space  distortions, 
with  narrow  and  well-behaved  window  functions  in  the  range  0.02  h  Mpc~'  <  k  <  0.3  h  Mpc-1.  We  pay  par¬ 
ticular  attention  to  modeling,  quantifying,  and  correcting  for  potential  systematic  errors,  nonlinear  redshift 
distortions,  and  the  artificial  red-tilt  caused  by  luminosity-dependent  bias.  Our  results  are  robust  to  omitting 
angular  and  radial  density  fluctuations  and  are  consistent  between  different  parts  of  the  sky.  Our  final  result  is  a 
measurement  of  the  real-space  matter  power  spectrum  P(k)  up  to  an  unknown  overall  multiplicative  bias  factor. 
Our  calculations  suggest  that  this  bias  factor  is  independent  of  scale  to  better  than  a  few  percent  for 
k  <  0.1  h  Mpc  1 ,  thereby  making  our  results  useful  for  precision  measurements  of  cosmological  parameters  in 
conjunction  with  data  from  other  experiments  such  as  the  Wilkinson  Microwave  Anisotropy  Probe  satellite.  The 
power  spectrum  is  not  well-characterized  by  a  single  power  law  but  unambiguously  shows  curvature.  As  a  simple 
characterization  of  the  data,  our  measurements  are  well  fitted  by  a  flat  scale-invariant  adiabatic  cosmological 
model  with  h  =  0.213  ±  0.023  and  (78  =  0.89  ±  0.02  for  T*  galaxies,  when  fixing  the  baryon  fraction 
Qb/flm  =  0.17  and  the  Hubble  parameter  h  =  0.72;  cosmological  interpretation  is  given  in  a  companion  paper. 

Subject  headings:  galaxies:  statistics  —  large-scale  structure  of  universe  —  methods:  data  analysis 
On-line  material:  color  figures 


1.  INTRODUCTION 

The  spectacular  recent  cosmic  microwave  background 
(CMB)  measurements  from  the  Wilkinson  Microwave  Anisot¬ 
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ropy  Probe  (WMAP)  satellite  (Bennett  et  al.  2003)  and  other 
experiments  have  increased  the  importance  of  non-CMB 
measurements  for  the  endeavor  to  constrain  cosmological 
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models  and  their  free  parameters.  These  non-CMB  con¬ 
straints  are  crucially  needed  for  breaking  CMB  degeneracies 
(Eisenstein  et  al.  1999;  Efstathiou  &  Bond  1999;  Bridle  et  al. 
2003);  for  instance,  WMAP  alone  is  consistent  with  a  closed 
universe  with  Hubble  parameter  h  =  0.32  and  no  cosmological 
constant  (Spergel  et  al.  2003;  Verde  et  al.  2003).  Yet  they  are 
currently  less  reliable  and  precise  than  the  CMB,  making  them 
the  limiting  factor  and  weakest  link  in  the  quest  for  precision 
cosmology.  Much  of  the  near-term  progress  in  cosmology  will 
therefore  be  driven  by  reductions  in  statistical  and  systematic 
uncertainties  of  non-CMB  probes  such  as  Lya  forest  and  gal¬ 
axy  clustering  and  motions,  gravitational  lensing,  cluster 
studies,  and  supernova  la  distance  determinations.  Galaxy 
redshift  surveys  can  play  a  key  role  in  breaking  degeneracies 
and  providing  cross-checks  (Tegmark  1997a;  Goldberg  & 
Strauss  1998;  Wang,  Spergel,  &  Strauss  1999;  Eisenstein  et  al. 
1999)  but  only  if  systematics  can  be  controlled  to  high  preci¬ 
sion.  The  goal  of  the  present  paper  is  to  do  just  this,  using  over 
200,000  galaxies  from  the  Sloan  Digital  Sky  Survey  (SDSS; 
York  et  al.  2000)  to  measure  the  shape  of  the  real-space  matter 
power  spectrum  P(k),  accurately  quantifying  and  correcting  for 
the  effects  of  light-to-mass  bias,  redshift-space  distortions, 
survey  geometry  effects,  and  other  complications. 

The  cosmological  constraining  power  of  three-dimensional 
maps  of  the  universe  provided  by  galaxy  redshift  surveys  has 
motivated  ever  more  ambitious  observational  efforts  such  as 
the  CfA/ Updated  Zwicky  Catalog  (UZC;  Huchra  et  al.  1990; 
Falco  et  al.  1999),  Las  Campanas  Redshift  Survey  (Shectman 
et  al.  1996),  and  PSCz  (Saunders  et  al.  2000)  surveys,  each 
well  in  excess  of  104  galaxies.  The  current  state  of  the  art  is 
the  Anglo-Australian  Telescope  Two  Degree  Field  Galaxy 
Redshift  Survey  (2dFGRS;  Colless  et  al.  2001;  Hawkins  et  al. 
2003;  Peacock  2003  and  references  therein).  Analysis  of  the 
first  147,000  2dFGRS  galaxies  (Peacock  et  al.  2001;  Percival 
et  al.  2001,  2002;  Norberg  et  al.  2001,  2002;  Madgwick  et  al. 
2002)  has  supported  a  flat,  dark  energy-dominated  cosmol¬ 
ogy,  as  have  angular  clustering  analyses  of  the  parent  catalogs 
underlying  the  2dFGRS  (Efstathiou  &  Moody  2001)  and 
SDSS  (Scranton  et  al.  2002;  Connolly  et  al.  2002;  Tegmark 
et  al.  2002b;  Szalay  et  al.  2003;  Dodelson  et  al.  2002).  Tan¬ 
talizing  evidence  for  baryonic  wiggles  in  the  galaxy  power 
spectrum  is  presented  by  Percival  et  al.  (2001),  Miller,  Nichol, 
&  Batuski  (2001a,  2001b),  and  Miller,  Nichol,  &  Chen 
(2002),  and  cosmological  models  have  been  further  con¬ 
strained  in  conjunction  with  CMB  data  (e.g.,  Spergel  et  al. 
2003;  Verde  et  al.  2003;  Lahav  et  al.  2002). 

The  SDSS  is  the  most  ambitious  galaxy  redshift  survey  to 
date,  whose  goal,  driven  by  large-scale  structure  science,  is  to 
measure  of  order  106  galaxy  redshifts.  Zehavi  et  al.  (2002) 
computed  the  correlation  function  using  about  30,000  galaxies 
from  early  SDSS  data  (Stoughton  et  al.  2002).  In  conjunction 
with  the  first  major  SDSS  data  release  in  2003  (hereafter  DR1; 
Abazajian  et  al.  2003),  a  series  of  papers  will  address  various 
aspects  of  the  three-dimensional  clustering  of  a  much  larger 
data  set  involving  over  200,000  galaxies  with  redshifts.  This 
paper  is  focused  on  measuring  the  power  galaxy  spectrum 
P(k)  on  large  scales,  dealing  with  complications  such  as 
luminosity-dependent  bias  and  redshift  distortions  only  to  the 
extent  necessary  to  recover  an  undistorted  measurement  of  the 
real-space  matter  power  spectrum.  Zehavi  et  al.  (2003)  mea¬ 
sure  and  model  the  real-space  correlation  function,  mainly  on 
smaller  scales,  focusing  on  departures  from  power-law  be¬ 
havior,  and  1.  Zehavi  et  al.  (2004,  in  preparation)  will  study 
how  the  correlation  function  depends  on  galaxy  properties. 


Pope  et  al.  (2004)  measure  the  parameters  that  characterize  the 
large-scale  power  spectrum  with  a  complementary  approach 
involving  direct  likelihood  analysis  on  Karhunen-Loeve  eigen- 
modes,  as  opposed  to  the  quadratic  estimator  technique  em¬ 
ployed  in  the  present  paper. 

This  paper  is  organized  as  follows.  In  §  2,  we  describe  the 
SDSS  data  used  and  how  we  model  it;  the  technical  details  can 
be  found  in  Appendix  A.  In  §  3,  we  describe  our  methodology 
and  present  our  basic  measurements  of  both  the  power  spec¬ 
trum  and  its  redshift-space  anisotropy.  The  details  of  the  for¬ 
malism  for  doing  this  are  described  in  Appendix  B.  In  §  4  we 
focus  on  this  anisotropy  to  model,  quantify,  and  correct  for  the 
effects  of  redshift-space  distortions,  producing  an  estimate  of 
the  real-space  galaxy  power  spectrum  and  testing  our  proce¬ 
dure  with  Monte  Carlo  simulations.  In  §  5,  we  model,  quan¬ 
tify,  and  correct  for  the  effects  of  luminosity-dependent 
biasing,  producing  an  estimate  of  the  real-space  matter  power 
spectrum.  In  §  6,  we  test  for  a  variety  of  systematic  errors.  In 
§  7,  we  discuss  our  results.  The  cosmological  interpretation  of 
our  measurements  is  given  in  a  companion  paper  (Tegmark 
et  al.  2003,  hereafter  Paper  II). 

2.  DATA  AND  DATA  MODELING 

The  SDSS  uses  a  mosaic  CCD  camera  (Gunn  et  al.  1998)  to 
image  the  sky  in  five  photometric  bandpasses  denoted  u,  g,  r, 
/',  and  z  (Fukugita  et  al.  1996). 24  After  astrometric  calibration 
(Pier  et  al.  2003),  photometric  data  reduction  (R.  Lupton  et  al. 
2004,  in  preparation;  see  Lupton  et  al.  2001  and  Stoughton 
et  al.  2002  for  summaries),  and  photometric  calibration  (Hogg 
et  al.  2001;  Smith  et  al.  2002),  galaxies  are  selected  for 
spectroscopic  observations  by  using  the  algorithm  described 
by  Strauss  et  al.  (2002).  To  a  good  approximation,  the  main 
galaxy  sample  consists  of  all  galaxies  with  /  -band  apparent 
Petrosian  magnitude  r  <17.77  (see  Appendix  A).  Galaxy 
spectra  are  also  measured  for  a  luminous  red  galaxy  sample 
(Eisenstein  et  al.  2001),  for  which  clustering  results  will  be 
reported  in  a  separate  paper.  These  targets  are  assigned  to 
spectroscopic  plates  by  an  adaptive  tiling  algorithm  (Blanton 
et  al.  2003b)  and  observed  with  a  pair  of  fiber-fed  CCD 
spectrographs  (A.  Uomoto  et  al.  2004,  in  preparation),  after 
which  the  spectroscopic  data  reduction  and  redshift  determi¬ 
nation  are  performed  by  automated  pipelines  (D.  Schlegel  et  al. 
2004,  in  preparation;  J.  Frieman  et  al.  2004,  in  preparation). 
The  rms  galaxy  redshift  errors  are  ^30  km  s_1  and  hence 
negligible  for  the  purpose  of  the  present  paper. 

Our  analysis  is  based  on  SDSS  sample  11  (Blanton  et  al. 
2003d),  consisting  of  the  205,443  galaxies  observed  before 
2002  July,  all  of  which  will  be  included  in  the  upcoming 
SDSS  Data  Release  2.  From  this  basic  sample,  we  produce  a 
set  of  subsamples  as  specified  in  Table  1 .  The  details  of  how 
this  basic  sample  was  processed,  modeled,  and  subdivided  are 
given  in  Appendix  A.  The  bottom  line  is  that  each  sample  is 
completely  specified  by  three  entities:  (1)  the  galaxy  positions 
(a  list  of  right  ascension,  declination,  and  comoving  redshift- 
space  distance  r  for  each  galaxy),  (2)  the  radial  selection 
function  h[r),  which  gives  the  expected  (not  observed)  num¬ 
ber  density  of  galaxies  as  a  function  of  distance,  and  (3)  the 
angular  selection  function  n(r),  which  gives  the  complete¬ 
ness  as  a  function  of  direction  in  the  sky.  Our  samples  are 

24  The  Fukugita  et  al.  (1996)  paper  actually  defines  a  slightly  different 
system,  denoted  u',  g' ,  r',  i' ,  r',  but  SDSS  magnitudes  are  now  referred  to  the 
native  filter  system  of  the  2.5  m  survey  telescope,  for  which  the  bandpass 
notation  is  unprimed. 
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TABLE  1 

Summary  of  Galaxy  Samples 


Area 

Sample  Name  Absolute  Magnitude  Apparent  Magnitude  Redshift  (deg2)  Galaxies 


All .  All  r  <  17.5—17.77  All  2417  205,443 

safeO .  All  14.50  <  r  <  17.50  All  2417  157,389 

safe  13 .  -23<M01,<— 18.5  14.50  <  r  <  17.50  0.001  <  z  <  0.400  2417  146,633 

safe22 .  -22  <M0Ar<  -19.0  14.50  <  r  <  17.50  0.001  <  z  <  0.400  2417  134,674 

Baseline .  -23  <  Mo  lr  <  - 18.5  14.50  <  r  <  17.50  0.002  <  z  <  0.210  2417  143,314 


Angular  Subsamples 

A 1  (south) .  —23  <  M0  i,.  <  — 18.5  14.50  <  r  <  17.50  0.017  <  z  <  0.210  600  35,782 

A2  (north) .  -23  <  M01,  <  - 18.5  14.50  <  r  <  17.50  0.017  <  z  <  0.210  1817  107,532 

A3  (north  equatorial) .  -23  <  M0Ar  <  - 18.5  14.50  <  r  <  17.50  0.017  <  z  <  0.210  809  52,081 

A4  (north  rest) .  -23  <M0  lr  < -18.5  14.50  <  r  <  17.50  0.017  <  z  <  0.210  1007  55,451 


Radial  Subsamples 

R1  (near) .  -23  <  M01,  <  - 18.5  14.50  <  r  <  17.50  0.017  <  z  <  0.078  2417  47,954 

R2  (middle) .  -23  <  M01,  <  - 18.5  14.50  <  r  <  17.50  0.078  <  z  <  0.1 17  2417  47,089 

R3  (far) .  -23  <  M0.i,  <  - 18.5  14.50  <  r  <  17.50  0.1 17  <  z  <  0.210  2417  48,271 


Luminosity  (Volume-limited)  Subsamples 

LI .  — 17  <  M0  n  <  — 16  14.50<r<  17.50  0.008  <  z  <  0.017  2417  455 

L2 .  — 18  <  Molr  <  — 17  14.50<r<  17.50  0.01 1  <  z  <  0.027  2417  1,736 

L3 .  — 19  <  Moir  <  — 18  14.50<r<  17.50  0.017  <  z  <  0.042  2417  5,191 

L4 .  —20  <  M0\r  <  — 19  14.50  <r<  17.50  0.027  <  z  <  0.065  2417  14,356 

L5 .  — 21  <  M0\r  <  —20  14.50<r<  17.50  0.042  <  z  <  0.103  2417  31,026 

L6 .  -22<M01,<-21  14.50<r<  17.50  0.065  <  z  <  0.157  2417  24,489 

L7 .  —23  <  Mo  lr  <  —22  14.50  <  r  <  17.50  0.104  <  z  <  0.238  2417  3,594 

L8 .  —24  <MQAr  <  —23  14.50  <  r  <  17.50  0.164  <  z  <  0.349  2417  95 


Mock  Samples 

M1-M275  (PTHalos) .  ...  ...  0.015  <  z  <  0.240  1395  108,300 

Vl-V  10  (VIRGO) .  ...  ...  0.001  <z<0.150  1139  103,400 


Note. — The  table  summarizes  the  various  galaxy  samples  used  in  our  analysis,  listing  cuts  made  on  evolution-corrected  absolute  magnitude 
MoAr  (for  h  =  1),  apparent  magnitude  r,  and  redshift  z.  A/o.  i r  was  computed  from  r  and  z  assuming  a  flat  cosmological  model  with  0.\  —  0.7. 


constructed  so  that  their  three-dimensional  selection  function 
is  separable,  i.e.,  simply  the  product  h(r)  =  h(r)n{r)  of  an 
angular  and  a  radial  part;  here  r  =  |r|  and  r  =  r/r  are  the 
comoving  radial  distance  and  the  unit  vector  corresponding  to 
the  position  r.  The  conversion  from  redshift  z  to  comoving 
distance  was  made  for  a  flat  cosmological  model  with  a  cos¬ 
mological  constant  Oa  =  0.7;  below  we  see  that  our  results 
are  insensitive  to  this  assumption. 

2. 1 .  Angular  Selection  Function 

The  angular  selection  function  n(r)  is  shown  in  Figure  1 .  For 
the  baseline  sample,  it  covers  a  sky  area  of  2499  deg2.  The 
function  n{r)  is  defined  to  be  the  completeness,  i.e.,  the  prob¬ 
ability  that  a  galaxy  satisfying  the  sample  cuts  actually  gets 
assigned  a  redshift  (including  the  6%  of  the  total  that  are  de¬ 
termined  on  the  basis  of  the  nearest  neighbor  redshift  as  de¬ 
scribed  in  Appendix  A).  Therefore,  the  completeness  is  a 
dimensionless  number  between  0  and  1 .  The  effective  area  is 
J«(r)  rfft  «  2417  deg2,  corresponding  to  an  average  com¬ 
pleteness  of  96.7%.  As  detailed  in  Appendix  A. 2,  we  model 
h(r)  as  a  piecewise  constant  function.  We  specify  this  function 
by  giving  its  value  in  each  of  a  large  number  of  disjoint 
spherical  polygons,  within  each  of  which  it  takes  a  constant 
value.  There  are  2914  such  polygons  for  the  baseline  sample, 


encoding  the  geometric  boundaries  of  spectroscopic  tiles, 
holes,  and  other  relevant  entities.  Figure  1  shows  that  the  sky 
coverage  naturally  separates  into  three  fairly  compact  regions 
of  comparable  size:  north  of  the  Galactic  plane  (in  the  center  of 
the  figure),  there  is  one  region  on  the  celestial  equator  and 
another  at  high  declination;  south  of  the  Galactic  plane,  there  is 
a  set  of  three  stripes  near  the  equator.  For  the  purpose  of  testing 
for  systematic  errors,  we  define  angular  subsamples  Al,  A3, 
and  A4  corresponding  to  these  regions  (see  Table  1),  which 
have  effective  areas  of  809,  1007,  and  600  deg2,  respectively. 

2.2.  Radial  Selection  Function 

Our  estimate  of  the  radial  selection  function  for  the  baseline 
sample  is  shown  in  Figures  2  and  3,  together  with  a  histogram 
of  the  galaxy  distances.  The  full  details  of  the  derivation  of  the 
radial  selection  function  can  be  found  in  Appendix  A. 4,  in¬ 
cluding  both  evolution  and  Tv-corrections.  Our  basic  sample 
has  magnitude  limits  r  >  14.5  at  the  bright  end  (since  the 
survey  becomes  incomplete  for  bright  galaxies  with  large 
angular  size)  and  r  <  17.5  at  the  faint  end  (Appendix  A. 4). 

Figure  4  shows  all  galaxies  within  5°  of  the  equator  in  a  pie 
diagram,  color  coded  by  their  absolute  magnitude,  and  illus¬ 
trates  one  of  the  fundamental  challenges  for  our  project  (and 
indeed  for  the  analysis  of  any  flux-limited  sample):  luminous 
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Fig.  1. — Top:  Angular  completeness  map,  the  relative  probabilities  that  galaxies  in  various  directions  get  included,  in  Hammer-Aitoff  projection  in  equatorial 
coordinates  on  a  gray  scale  ranging  from  black  (0)  to  white  (1).  It  is  this  completeness  map  that  we  expand  in  spherical  harmonics.  The  backdrop  is  the  logarithm  of 
the  dust  map  from  Schlegel  et  al.  (1998),  indicating  which  sky  regions  are  most  likely  to  be  affected  by  extinction-related  systematic  errors.  [See  the  electronic 
edition  of  the  Journal  for  a  color  version  of  this  figure .]  Bottom:  Complex  nature  of  the  completeness  map  and  the  high  average  completeness  with  a  zoom  of  a  small 
sky  region. 


galaxies  dominate  the  sample  at  large  distances,  and  dim  ones 
dominate  nearby.  A  measurement  of  the  power  spectrum  on 
very  large  scales  is  therefore  statistically  dominated  by  lumi¬ 
nous  galaxies,  whereas  a  measurement  on  small  scales  is 
dominated  by  dim  ones  (since  they  have  much  higher  number 
density).  Yet  it  is  well  known  that  luminous  galaxies  cluster 
more  than  dim  ones  (e.g.,  Davis  et  al.  1988;  Hamilton  1988; 
Norberg  et  al.  2001;  Zehavi  et  al.  2002;  Verde  et  al.  2003),  so 
when  comparing  P(k)  on  large  and  small  scales  we  are  in 
effect  comparing  apples  with  oranges  and  may  mistakenly 
conclude  that  the  power  spectrum  is  red-tilted  (with  a  spectral 
index  n  =  0.94,  say)  even  if  the  truth  is  n  =  1.  Thus  far,  no 
magnitude-limited  galaxy  power  spectrum  analysis  has  been 
corrected  for  this  effect.  We  do  so  in  §  5.  Although  this  effect 


is  not  large  in  an  absolute  sense,  we  find  that  it  must  none¬ 
theless  be  included  for  precision  cosmology  applications. 

The  first  step  is  to  quantify  the  luminosity  dependence  of 
bias.  For  this  purpose,  we  define  a  series  of  volume-limited 
samples  as  specified  in  Table  1,  constructed  by  discarding  all 
galaxies  that  are  too  faint  to  be  included  at  the  far  limit  or  too 
bright  to  be  included  at  the  near  limit.  This  gives  a  radial 
selection  function  (Fig.  3)  that  is  constant  within  the  radial 
limits  and  zero  elsewhere.  The  radial  limits  are  set  so  that 
galaxies  at  the  far  (near)  radial  limit  and  the  dim  (luminous) 
end  of  the  absolute  magnitude  range  in  question  have  fluxes  at 
the  faint  (bright)  flux  limit.  Because  the  flux  range  14.5  <  r  < 
17.5  spans  exactly  3  mag,  these  subsamples  overlap  spatially 
only  with  their  nearest  neighbor  samples  and  have  a  near  limit 
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Fig.  2. — Redshift  distribution  of  the  galaxies  in  sample  safe  13  shown  as  a 
histogram  and  compared  with  the  expected  distribution  in  the  absence  of 
clustering,  In  10 h(r)r3  dfl  ( solid  curve)  in  comoving  coordinates  assuming 
a  flat  Ha  =  0.7  cosmology.  Bottom:  Ratio  of  observed  and  expected  dis¬ 
tributions.  The  vertical  lines  indicate  the  redshift  limits  (50  Mpc  <  r  < 
600  h~l  Mpc)  employed  in  the  baseline  analysis.  This  near  cut  removes  only 
22  galaxies,  the  far  cut,  3295.  [See  the  electronic  edition  of  the  Journal  for  a 
color  version  of  this  figure.] 


Comoving  distance  r  [h-1  Mpc] 


Fig.  3. — Same  as  Fig.  2  but  plotted  as  comoving  number  density.  The  gray 
(background)  histogram  shows  the  full  flux-limited  sample,  and  the  others 
show  the  volume-limited  subsamples,  with  lines  indicating  their  predicted 
constant  selection  functions.  [See  the  electronic  edition  of  the  Journal  for  a 
color  version  of  this  figure .] 


Fig.  4. — Distribution  of  67,676  galaxies  within  5°  of  the  equatorial  plane, 
color  coded  by  their  absolute  magnitudes.  Mr  in  the  figure  refers  to  the  ab¬ 
solute  r-magnitude  ^-corrected  to  z  =  0.1.  [See  the  electronic  edition  of  the 
Journal  for  a  color  version  of  this  figure.] 

that  would  be  equal  to  the  far  limit  of  the  sample  that  is  two 
notches  more  luminous  if  it  were  not  for  evolution  and 
K-corrections.  This  is  clear  in  Figures  5  and  6.  These  samples 
have  the  advantage  that  the  measured  clustering  is  that  of  a 
well-defined  set  of  objects  whose  selection  is  redshift  inde¬ 
pendent.  Although  we  have  not  accounted  for  our  surface 
brightness  limits  in  defining  these  samples,  very  few  of  even 
the  lowest  luminosity  galaxies  in  our  sample  are  affected  by 
the  surface  brightness  limits  of  the  survey  (Blanton  et  al. 
2003d;  Strauss  et  al.  2002). 

3.  METHOD  AND  BASIC  ANALYSIS 

We  now  turn  to  our  basic  goal:  accurately  measuring  the 
shape  of  the  matter  power  spectrum  P(k)  on  large  scales  by 
using  the  data  described  above,  i.e.,  measuring  a  curve  that 
equals  the  large-scale  matter  power  spectrum  up  to  an  un¬ 
known  overall  multiplicative  bias  factor  that  is  independent  of 
scale.  This  involves  four  basic  challenges:  (1)  accounting  for 
the  complicated  survey  geometry,  (2)  correcting  for  the  effect 
of  redshift-space  distortions,  (3)  correcting  for  bias  effects, 
which  as  described  in  §  5  cause  an  artificial  red-tilt  in  the 
power  spectrum,  and  (4)  checking  for  potential  systematic 
errors.  Before  delving  into  detail,  let  us  summarize  each  of 
these  challenges  and  how  we  will  tackle  them. 

3.1.  Battle  Plan 

3.1.1.  Survey  Geometry  and  Method  of  Estimating  Power 

It  is  well  known  that  since  galaxies  in  a  redshift  survey 
probe  the  underlying  density  field  only  in  a  finite  volume,  the 
power  spectrum  estimated  with  traditional  Fourier  methods 
(e.g.,  Percival  et  al.  2001;  Feldman,  Kaiser,  &  Peacock  1994) 
is  complicated  to  interpret:  it  corresponds  to  a  smeared  out 
version  of  the  true  power  spectrum,  can  underestimate  power 
on  the  largest  scales  because  of  the  so-called  integral  con¬ 
straint  (Peacock  &  Nicholson  1991),  and  has  correlated  errors. 


No.  2,  2004 


3D  POWER  SPECTRUM  OF  GALAXIES  FROM  SDSS 


707 


-500  0  500 

Comoving  x  [h_1  Mpc]  (towards  ra=0) 


>> 

G 


O 

B 

o 


-500  0  500 

Comoving  x  [h-1  Mpc]  (towards  ra=0) 


Fig.  5. — Distribution  of  galaxies  within  5°  of  the  equatorial  plane  for  the 
volume-limited  subsamples  LI,  L3,  L5,  and  L7  from  Table  1.  [See  the  elec¬ 
tronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


Fig.  6. — Distribution  of  galaxies  within  5°  of  the  equatorial  plane  for  the 
remaining  four  volume-limited  subsamples  from  Table  1,  i.e.,  L2,  L4,  L6,  and 
L8.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


We  therefore  measure  power  spectra  with  an  alternative, 
matrix-based  approach,  which,  although  more  numerically 
demanding,  has  several  advantages  on  the  large  scales  that  are 
the  focus  of  this  paper.  It  facilitates  tests  for  radial  and  angular 
systematic  errors.  If  galaxies  were  faithful  tracers  of  mass, 
then  it  would  produce  unbiased  minimum-variance  power 
spectrum  measurements  with  uncorrelated  error  bars  that  are 
smaller  than  those  from  traditional  Fourier  methods.  The 
power  smearing  is  quantified  by  window  functions  that  are 
both  narrower  than  with  traditional  Fourier  methods  and  can 
be  computed  without  need  for  approximations  or  Monte 
Carlo  simulations.  (We  do,  however,  use  Monte  Carlo  simu¬ 
lations  in  §  4  to  verify  that  the  method  and  software  work 
as  advertised.) 

3.1.2.  Redshift-Space  Distortions 

Our  basic  input  data  consist  of  galaxy  positions  r  in  three- 
dimensional  “redshift  space,”  where  the  comoving  distance 
r  =  |r|  is  that  which  would  explain  the  observed  redshift  if  the 
galaxy  were  merely  following  the  Hubble  flow  of  the  expanding 
universe.  The  same  gravitational  forces  that  cause  galaxies  to 
cluster  also  cause  them  to  move  relative  to  the  Hubble  flow,  and 
these  so-called  peculiar  velocities  make  the  clustering  in  red¬ 
shift  space  anisotropic  (Kaiser  1987;  Hamilton  1998).  Al¬ 
though  this  effect  can  be  modeled  and  accounted  for  exactly  on 
very  large  scales  on  which  the  clustering  is  linear  (Kaiser  1 987), 
nonlinear  corrections  cannot  be  neglected  on  some  of  the  scales 
of  interest  to  us  (Scoccimarro  et  al.  2001;  Seljak  2001;  Cole, 
Fisher,  &  Weinberg  1994;  Hatton  &  Cole  1998).  Section  4  is 
devoted  to  dealing  with  this  complication,  going  beyond  the 
Kaiser  approximation  with  a  three-pronged  approach: 

1.  We  precede  our  power  spectrum  analysis  by  a  nonlinear 
”finger-of-God”  compression  step  with  a  tunable  threshold,  to 
quantify  the  sensitivity  of  our  results  to  nonlinear  galaxy 
groups  and  clusters. 


2.  We  measure  three  power  spectra  (galaxy-galaxy,  galaxy- 
velocity,  and  velocity-velocity  spectra)  rather  than  one,  quan¬ 
tifying  the  clustering  anisotropy  and  allowing  the  real-space 
power  to  be  reconstructed  beyond  linear  order. 

3.  We  analyze  an  extensive  set  of  mock  galaxy  catalogs  to 
quantify  the  accuracy  of  our  results  and  measure  how  the 
nonlinear  correction  factor  grows  toward  smaller  scales. 

Our  mock  analysis  will  also  allow  us  to  quantify  the  effects  of 
nonlinear  clustering  on  the  error  bars  and  band-power  corre¬ 
lations.  Step  1  is  optional,  and  we  present  results  both  with 
and  without  it. 

3.1.3.  Bias 

All  we  can  ever  measure  with  galaxy  redshift  surveys  is 
galaxy  clustering,  whereas  what  we  care  about  for  con¬ 
straining  cosmological  models  is  the  clustering  of  the  under¬ 
lying  matter  distribution.  Our  ability  to  do  cosmology  with 
the  real-space  galaxy  power  spectrum  Pgg{k)  is  therefore  only 
as  good  as  our  understanding  of  bias,  i.e.,  the  relation  of 
Pgg(k)  to  the  matter  power  spectrum  P(k).  Pessimists  have 
often  argued  that  since  we  do  not  understand  galaxy  forma¬ 
tion  at  high  precision,  we  cannot  understand  bias  accurately 
either,  so  galaxy  surveys  will  be  relegated  to  a  historical 
footnote,  having  no  role  to  play  in  the  precision  cosmology 
era.  Optimists  retort  that  no  matter  how  complicated  the 
gasdynamical  and  radiative  processes  involved  in  galaxy 
formation  may  be,  they  have  only  a  finite  spatial  range  (a  few 
hr1  Mpc,  say),  leading  to  a  generic  prediction  that  bias  on 
much  larger  scales  (>20  h  Mpc,  say)  should  be  scale  in¬ 
dependent  for  any  particular  type  of  galaxy  (Coles  1993; 
Fry  &  Gaztanaga  1993;  Scherrer  &  Weinberg  1998;  Mann, 
Peacock,  &  Heavens  1998;  Coles,  Melott,  &  Munshi  1999; 
Heavens,  Matarrese,  &  Verde  1998;  Blanton  et  al.  2000; 
Narayanan,  Berlind,  &  Weinberg  2000).  This  theoretical  ex¬ 
pectation  is  supported  by  visual  inspection  of  the  galaxy 
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distribution.  Comparing  early-  and  late-type  galaxies  in  the 
2dFGRS  (Peacock  2003;  Madgwick  et  al.  2003)  shows  that 
whereas  the  small-scale  distribution  differs  (elliptical  gal¬ 
axies  display  a  more  “skeletal”  distribution  than  do  cluster- 
shunning  spiral  galaxies),  their  large-scale  clustering  patterns 
are  indistinguishable. 

We  devote  §  5  to  the  bias  issue,  arguing  that  both  the  pes¬ 
simists  and  the  optimists  have  turned  out  to  be  right:  yes, 
biasing  is  indeed  complicated  on  small  scales  (where  the 
galaxy  power  spectrum  will  therefore  teach  us  more  about 
galaxy  formation  than  about  cosmology),  but  no,  this  in  no 
way  prevents  us  from  doing  precision  cosmology  with  the 
galaxy  power  spectrum  on  very  large  scales.  Our  main  tool  is 
analyzing  our  volume-limited  magnitude  subsamples,  show¬ 
ing  that  their  large-scale  power  spectra  are  consistent  with  all 
having  the  same  shape  and  differing  merely  in  amplitude. 

Although  the  argument  above  for  scale-independent  bias 
holds  only  for  a  volume-limited  subsample,  we  wish  to  use 
our  full  galaxy  sample  over  a  broad  range  of  redshifts,  both  to 
expand  the  range  of  A-scales  probed  and  to  reduce  shot  noise. 
We  therefore  use  our  measured  luminosity  dependence  of 
bias  to  compute  and  remove  the  artificial  red-tilt  in  our  full 
magnitude-limited  baseline  sample. 

In  future  papers,  we  will  constrain  galaxy  bias  empirically 
by  using  clustering  measurements  on  smaller  scales  (e.g., 

I.  Zehavi  et  al.  2004,  in  preparation),  which  will  allow  us  to 
calculate  the  effects  of  sca/e-dependent  bias  on  the  power 
spectrum  in  the  nonlinear  regime  and  thus  to  extend  the 
measurement  of  the  matter  power  spectrum  shape  to  smaller 
scales. 

3.1.4.  Systematic  Errors 

As  the  old  saying  goes,  the  devil  you  know  poses  a  lesser 
threat  than  the  devil  you  do  not.  We  therefore  devote  §  6  to 
testing  for  the  sort  of  effects  that  are  not  included  in  our  Monte 
Carlo  simulations.  This  includes  both  radial  modulations  (due 
to  misestimates  of  evolution  or  K-corrections)  and  angular 
modulations  (due  to  effects  such  as  uncorrected  dust  extinction, 
variable  observing  conditions,  photometric  calibration  errors, 
and  fiber  collisions).  Our  tests  use  two  basic  approaches: 

1.  Analyzing  subsets  of  galaxies. — We  compare  the  power 
spectra  from  different  parts  of  the  sky  (subsamples  A1-A4 
from  Table  1)  and  different  distance  ranges  (subsamples  Rl- 
R3)  looking  for  inconsistencies. 

2.  Analyzing  subsets  of  modes. — We  look  for  excess  power 
in  purely  angular  and  purely  radial  modes  of  the  density  field, 
which  act  like  lightning  rods  for  angular  and  radial  modulations 
such  as  those  mentioned  above. 

3.2.  Three-Step  Power  Spectrum  Estimation 

Our  matrix-based  power  spectrum  estimation  approach  is 
described  in  Tegmark  et  al.  (1998).  It  starts  by  expanding  the 
galaxy  density  field  in  a  set  of  functions  known  as  pseudo- 
Karhunen-Loeve  eigenmodes.  This  step  compresses  the  data 
set  into  a  much  smaller  size  (from  hundreds  of  thousands  of 
galaxy  coordinates  to  a  few  thousand  expansion  coefficients) 
while  retaining  the  large-scale  cosmological  information  in 
which  we  are  interested.  It  also  reduces  the  power  spectrum 
estimation  problem  to  a  mathematical  form  equivalent  to  that 
encountered  in  CMB  analysis,  enabling  us  to  take  advantage 
of  a  powerful  set  of  matrix-based  tools  that  have  been  fruit¬ 
fully  employed  in  the  CMB  field.  Our  basic  analysis  in  the 


remainder  of  this  section  therefore  consists  of  the  following 
three  steps: 

1.  Finger-of-God  compression  to  remove  redshift-space 
distortions  due  to  virialized  structures  (§  3.3). 

2.  Pseudo-Karhunen-Loeve  eigenmode  expansion  (§  3.4). 

3.  Power  spectrum  estimation  using  quadratic  estimators 
(§3.6). 

As  mentioned,  the  third  step  measures  not  one  but  three  power 
spectra,  encoding  clustering  anisotropy  that  contains  infor¬ 
mation  about  redshift-space  distortions.  In  this  section,  we 
merely  present  the  basic  measurement  of  these  three  curves, 
which  involves  no  assumptions  about  linearity,  the  nature  of 
biasing,  or  anything  else.  We  then  return  to  modeling  and 
interpreting  these  curves  in  terms  of  real-space  power  in  §  4 
and  to  bias  modeling  in  §  5. 

3.3.  Step  1:  Finger-of-God  Compression 

Since  our  analysis  method  is  motivated  by  (although  not 
limited  to)  the  linear  Kaiser  approximation  for  redshift-space 
distortions,  it  is  crucial  that  we  be  able  to  empirically  quantify 
our  sensitivity  to  the  so-called  finger-of-God  (FOG)  effect, 
whereby  radial  velocities  in  virialized  clusters  make  them 
appear  elongated  along  the  line  of  sight.  We  therefore  start  our 
analysis  by  compressing  (isotropizing)  FOGs,  as  illustrated  in 
Figure  7.  The  FOG  compression  involves  a  tunable  threshold 
density,  and  in  §  4  we  study  how  the  final  results  change  as  we 
gradually  change  this  threshold  to  include  lesser  or  greater 
numbers  of  FOGs. 

We  use  a  standard  friends-of-friends  algorithm,  in  which 
two  galaxies  are  considered  friends,  therefore  in  the  same 
cluster,  if  the  density  windowed  through  an  ellipse  10  times 
longer  in  the  radial  than  transverse  direction,  centered  on  the 
pair,  exceeds  a  certain  overdensity  threshold.  To  avoid  linking 
well-separated  galaxies  in  deep,  sparsely  sampled  parts  of  the 
survey,  we  impose  the  additional  constraint  that  friends  should 
be  closer  than  rj_  max  =  5  h~l  Mpc  in  the  transverse  direction. 
The  two  conditions  are  combined  into  the  following  single 
criterion:  two  galaxies  separated  by  ry  in  the  radial  direction 
and  by  r  in  the  transverse  direction  are  considered  friends  if 

r/r„x2  l1/2  r  4  1-1/3 

(iq)  +  rl  <  y  7T«(  1  +  bc)  +  7j_  max  ,  (1) 

where  h  is  the  three-dimensional  selection  function  at  the 
position  of  the  pair  and  Sc  is  an  overdensity  threshold.  Note 
that  Sc  represents  not  the  overdensity  of  the  pair  as  seen  in 
redshift  space  but  rather  the  overdensity  of  the  pair  after  their 
radial  separation  has  been  reduced  by  a  factor  of  10.  Thus,  8C 
is  intended  to  approximate  the  threshold  overdensity  of  a 
cluster  in  real  space.  We  have  chosen  r±  max  somewhat  larger 
than  the  virial  diameter  of  typical  clusters  to  be  conservative, 
minimizing  the  risk  of  missing  FOGs — for  our  baseline 
threshold  6C  =  200,  our  results  are  essentially  unaffected  by 
this  choice  of  r±  max.  Having  identified  a  cluster  by  friends-of- 
friends  in  this  fashion,  we  measure  the  dispersion  of  galaxy 
positions  about  the  center  of  the  cluster  in  both  radial  and 
transverse  directions.  If  the  one-dimensional  radial  dispersion 
exceeds  the  transverse  dispersion,  then  the  cluster  is  deemed  a 
FOG,  and  the  FOG  is  then  compressed  radially  so  that  the 
cluster  becomes  round,  that  is,  the  transverse  dispersion 
equals  the  radial  dispersion.  We  perform  the  entire  analysis 
5  times,  using  density  cutoffs  1  +  6C  =  oo,  200,  100,  50,  and 
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Fig.  7. — Effect  of  our  FOG  removal  shown  in  part  of  the  equatorial  slice  |<$|  <  5°  (left)  before  and  (right)  after  FOG  compression.  Top,  All  67,626  galaxies  in  the 
slice;  bottom ,  28,255  that  are  identified  as  belonging  to  FOGs  (with  density  threshold  200). 


25,  respectively;  in  our  analyses  below,  we  explore  the  sen¬ 
sitivity  of  our  results  to  this  cutoff.  The  infinite  threshold 
1+5C  =  oo  corresponds  to  no  compression  at  all. 

Figure  7  illustrates  FOG  compression  with  threshold  den¬ 
sity  l+6c  =  200,  and  unless  explicitly  stated  otherwise,  all 
results  presented  in  this  paper  are  for  this  threshold  density. 
We  make  this  choice  to  be  on  the  safe  side:  Bryan  &  Norman 
(1998)  estimate  that  the  overdensity  of  a  cluster  at  virialization 
is  about  337  in  a  ACDM  model,  rising  further  as  the  universe 
expands  and  the  background  density  continues  to  drop. 

3.4.  Step  2:  Pseudo-KL  Pixelization 

The  raw  data  consist  of  three-dimensional  vectors  ra , 
a  =  1,  .  .  .  ,  fVgaj,  giving  the  measured  positions  of  each  galaxy 
in  redshift  space,  with  the  number  of  galaxies  Aga j  given  in 
Table  1  for  each  sample.  As  in  Tegmark  et  al.  (1998),  we 
expand  the  observed  three-dimensional  density  field  in  a  basis 
of  Nx  noise-orthonormal  functions  i  =  1 ,  .  .  .  ,  Nx, 

XlSJw)*‘{r)  dy  (2) 


refer  to  the  functions  '</),•  as  “modes.”  As  we  see  below,  these 
modes  play  a  role  quite  analogous  to  pixels  in  CMB  maps, 
with  the  variance  of  xt  depending  linearly  on  the  power 
spectrum  that  we  wish  to  measure. 

Galaxies  are  (from  a  cosmological  perspective)  ^-functions 
in  space,  so  the  integral  in  equation  (2)  reduces  to  a  discrete 
sum  over  galaxies.  We  do  not  rebin  the  galaxies  spatially, 
which  would  artificially  degrade  the  resolution.  It  is  conve¬ 
nient  to  isolate  the  mean  density  into  a  single  mode  ipi{r)  = 
n(r),  with  amplitude 

x\  =  J  n(r)  d3r  =  Ngal,  (3) 

and  to  arrange  for  all  other  modes  to  have  zero  mean 

{xi)  =  J  ip i(r )  d\  =  0,  (/VI).  (4) 

The  covariance  matrix  of  the  vector  x  of  amplitudes  is  a  sum 
of  noise  and  signal  terms 


and  work  with  the  A, -dimensional  data  vector  x  of  expansion  (AxAx‘)  =  C  =  N  +  S, 

coefficients  instead  of  the  3  x  Aga i  numbers  ra .  Here  n  is 

the  three-dimensional  selection  function  described  in  §  2;  i.e.,  .  .  .  .  . 

Mr)  dV  is  the  expected  (not  the  observed)  number  of  galaxies  where  the  shot-nolse  covariance  matrix  is  given  by 

in  a  volume  dV  about  r  in  the  absence  of  clustering,  and  the 
integration  is  carried  out  over  the  volume  V  of  the  sample 
where  the  selection  function  h(r)  is  nonzero.  We  frequently 


n{r) 


(5) 

(6) 
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Fig.  8. — Sample  of  four  angular  pseudo-KL  (PKL)  modes  are  shown  in 
Hammer- Aitoff  projection  in  equatorial  coordinates,  with  gray  representing 
zero  weight  and  lighter/darker  shades  indicating  positive/negative  weight, 
respectively.  Uniform  gray  areas  are  outside  the  observed  region.  From  top  to 
bottom,  they  are  angular  modes  1  (the  mean  mode),  3,  35,  and  286  and  are 
seen  to  probe  progressively  smaller  angular  scales.  [See  the  electronic  edition 
of  the  Journal  for  a  color  version  of  this  figure .] 

and  the  signal  covariance  matrix  is 

Sy=  f  Mk)MkYpUk)  7^4-  (7) 

J  (2n) 

in  the  absence  of  redshift-space  distortions  (which  are 
included  in  §  3.5).  Here  hats  denote  Fourier  transforms, 
and  the  asterisk  denotes  complex  conjugation.  Pgg(k )  is  the 
(real-space)  galaxy  power  spectrum,  which  for  a  random 
field  of  density  fluctuations  6(r)  is  defined  by  (6(k)6(k')*)  = 


(27r)3(>Dirac {k  —  k')Pgg{k).  We  take  the_  functions  ipj  to  have 
units  of  inverse  volume,  so  x,  N,  S,  and  tp, are  all  dimensionless. 

Our  method  requires  computing  the  signal  covariance 
matrix  S ,  both  to  calculate  power  spectrum  error  bars  and  to 
find  the  power  spectrum  estimator  that  minimizes  them. 
Equation  (7)  shows  that  this  requires  assuming  a  power 
spectrum  Pgg(k).  For  this  spectrum,  which  we  refer  to  as  our 
“prior,”  we  use  a  simple  two-parameter  fit  as  described  in 
§  B.4.1,  whose  parameters  are  determined  from  our  mea¬ 
surements  by  iterating  the  entire  analysis. 

As  our  functions  ipi(r),  we  use  the  pseudo-Karhunen- 
Loeve  (PKL)  eigenmodes  defined  in  Hamilton,  Tegmark,  & 
Padmanabhan  (2000,  hereafter  HTP00)  and  Tegmark  et  al. 
(2002a,  hereafter  THX02).  The  construction  of  these  PKL 
modes  tpi(r)  explicitly  uses  the  three-dimensional  selection 
function  h(r)  but  is  model  independent  since  it  does  not  de¬ 
pend  on  the  power  spectrum. 

To  provide  an  intuitive  feel  for  the  nature  of  these  modes, 
a  sample  is  plotted  in  Figures  8  and  9.  We  use  these  modes 
because  they  have  the  following  desirable  properties: 

1.  They  form  a  complete  set  of  basis  functions  probing 
successively  smaller  scales,  so  that  a  finite  number  of  them  (we 
use  the  first  4000,  for  the  reasons  given  in  §  B.4.2)  allow 
essentially  all  information  about  the  density  field  on  large 
scales  to  be  distilled  into  the  vector  x. 

2.  They  are  orthonormal  with  respect  to  the  shot  noise,  i.e., 
such  that  equation  (6)  gives  N  =  /,  the  identity  matrix.  The 
construction  of  the  modes  thus  depends  explicitly  on  the  survey 
geometry  as  specified  by  n{r),  and  ipi{r)  =  0  in  regions  of 
space  where  n(r)  =  0. 

3.  They  allow  the  covariance  matrix  S  to  be  fairly  rapidly 
computed. 

4.  They  are  the  product  of  an  angular  and  a  radial  part;  i.e., 
they  take  the  separable  form  ipi(r)  =  ipj(r)ipj(r),  which  accel¬ 
erates  numerical  computations  and  helps  isolate  radial  and 
angular  systematic  problems. 

5.  A  range  of  potential  sources  of  systematic  problems  are 
isolated  into  special  modes  that  are  orthogonal  to  all  other 
modes.  This  means  that  we  can  test  for  the  presence  of  such 
problems  by  looking  for  excess  power  in  these  modes  and 
immunize  against  their  effects  by  discarding  these  modes. 

We  have  four  types  of  such  special  modes: 

1.  The  very  first  mode  is  the  mean  density,  ip i(r)  =  h(r). 
The  mean  mode  is  used  in  determining  the  maximum  likeli¬ 
hood  normalization  of  the  selection  function  but  is  then  dis¬ 
carded  from  the  analysis,  since  it  is  impossible  to  measure  the 
fluctuation  of  the  mean  mode.  The  idea  of  solving  the  so-called 
integral  constraint  problem  by  making  all  modes  orthogonal  to 
the  mean  (eq.  [4])  goes  back  to  Fisher  et  al.  (1993). 

2.  Modes  2-8  are  associated  with  the  motion  of  the  Local 
Group  through  the  CMB  at  622  km  s  1  toward  (B1950  FK4) 
R.A.  =  162°,  decl.  =  —27°  (Lineweaver  et  al.  1996;  Courteau 
&  van  den  Bergh  1999).  To  first  order,  these  modes  are  the 
only  modes  affected  by  misestimates  of  the  motion  of  the  Local 
Group. 

3.  Purely  radial  modes  (e.g.,  mode  468  in  Fig.  9)  are  to  first 
order  the  only  ones  affected  by  misestimates  of  the  radial  se¬ 
lection  function  h[r). 

4.  Purely  angular  modes  (e.g.,  mode  859  in  Fig.  9)  are  to 
first  order  the  only  ones  affected  by  misestimates  of  the  angular 
selection  function  n(r),  as  may  result  from  inadequate  correc¬ 
tions  for,  e.g.,  extinction,  the  variable  magnitude  limit,  the 
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Fig.  9. — Sample  of  six  pseudo-KL  modes  shown  in  the  equatorial  plane.  Gray  represents  zero  weight,  and  lighter/darker  shades  indicate  positive/negative  weight, 
respectively.  Uniform  gray  areas  are  outside  the  volume  used.  Bottom:  Examples  of  special  modes,  (left)  a  purely  radial  mode  and  (right)  a  purely  angular  mode. 
[j See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


variable  magnitude  completeness,  or  photometric  zero-point 
offsets. 

The  computation  of  the  modes  in  practice  is  described  in  detail 
in  THX02  and  in  even  more  detail  in  Hamilton  &  Tegmark 
(2003). 

The  pixelized  data  vector  x  is  shown  in  Figure  10,  and  its 
distribution  is  shown  in  Figure  11.  This  data  compression  step 
has  thus  distilled  the  large-scale  information  about  the  galaxy 
density  field  from  2>N„a\  =  429, 942  galaxy  coordinates  into 
4000  PKL  coefficients.  The  order  of  these  coefficients  is  one 
of  decreasing  scale  (increasing  k)  as  is  shown  in  Figure  12.  If 
there  were  no  cosmological  density  fluctuations  in  the  survey, 
merely  Poisson  fluctuations,  the  PKL  coefficients  x,  would  be 
uncorrelated  with  unit  variance  (since  N  =  /),  so  about  68% 
of  them  would  be  expected  to  lie  within  the  dark  gray  band. 


Figure  10  shows  that  the  fluctuations  are  considerably  larger 
than  Poisson,  especially  for  the  largest  scale  modes  (to  the 
left),  demonstrating  the  obvious  fact  that  cosmological  density 
fluctuations  are  present,  as  expected. 

3.5.  What  We  Wish  To  Measure:  Three  Power  Spectra,  Not  One 

Following  HTP00  and  THX02,  we  measure  three  separate 
power  spectra,  whose  ratios  encode  information  about  clus¬ 
tering  anisotropy  due  to  redshift-space  distortions.  Let  us  now 
give  their  definition  and  some  intuition  for  how  to  interpret 
them. 

On  large  scales  where  redshift-space  distortions  can  be 
treated  in  the  linear  approximation  (Kaiser  1987),  the  signal 
covariance  matrix  S  in  equation  (5)  can  be  generalized  from 
equation  (7)  and  written  in  the  form 
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Fig.  10. — Four  thousand  elements  X;  ( triangles )  of  the  data  vector  x  (the 
pseudo-KL  expansion  coefficients)  for  the  baseline  galaxy  sample.  If  there  were 
no  clustering  in  the  survey,  merely  shot  noise,  they  would  have  unit  variance, 
and  about  68%  of  them  would  lie  within  the  dark  gray  band.  If  our  prior  power 
spectrum  were  correct,  then  the  standard  deviation  would  be  larger,  as  indicated 
by  the  shaded  light  gray  band.  To  reduce  clutter,  the  modes  are  (apart  from  the 
first  eight  special  modes)  ordered  by  decreasing  signal-to-noise  ratio,  which 
corresponds  approximately  to  the  ordering  by  scale  in  Fig.  12.  [See  the  elec¬ 
tronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


f°°  k 2  dk 

S=  [Pgg(k)Pgg(k)  +  Pgv{k)Pgv(k)  +  Pw(i)Pw(t)]  — — y  5 

Jo  (z7 r) 

(8) 

where  Pgg{k),  Pgv(k),  and  Pvv(k)  are  three  power  spectra  de¬ 
fined  in  real  space  (as  opposed  to  redshift  space)  and  Pgg{k), 


*/<#* 


Fig.  11. — Gaussianity  of  fluctuations.  The  histogram  shows  the  distribution 
of  the  4000  PKL  coefficients  x,  from  the  previous  figure  after  dividing  by  their 
predicted  standard  deviation  Cj/2,  assuming  our  prior  power  spectrum.  The 
histogram  has  been  normalized  to  have  unit  area.  The  solid  curve  shows  a 
Gaussian  of  unit  variance,  zero  mean,  and  unit  area.  [See  the  electronic  edition 
of  the  Journal  for  a  color  version  of  this  figure .] 


Pgv(k),  and  Pm(k)  are  known  dimensionless  matrix-valued 
functions.  We  refer  to  these  three  power  spectra  as  the  galaxy- 
galaxy  power,  the  galaxy-velocity  power,  and  the  velocity- 
velocity  power,  respectively,  or  gg,  gv,  and  vv  for  short. 
Specifically,  Pgg(k)  is  the  real-space  galaxy  power  spectrum, 
Pvv(k )  is  the  velocity  power  spectrum,  and  Pgv(k )  is  the  cross 
power  between  galaxies  and  velocity.  More  rigorously,  “ve¬ 
locity”  here  refers  to  minus  the  velocity  divergence  V-u, 
which  in  linear  theory  is  related  to  the  mass  (not  galaxy) 
overdensity  6  by  f&  +  V-v  =  0,  where  V  denotes  the  comov¬ 
ing  gradient  in  velocity  units.  Here  /  ss  is  the  dimen¬ 
sionless  growth  rate  for  linear  density  perturbations  (see 
Hamilton  2001).  The  three  matrix-valued  functions  are  de¬ 
termined  directly  from  the  modes  P;,  i.e.,  by  geometry  alone: 


II 

Ts 

j  $,■(*)$,•(*)*  dVLk. 

(9) 

(P9v)ij  = 

J 

[  [h(k)Q(ky  +  6  (k)i-(k)*]  dnk, 

(10) 

(Pn>)iJ  = 

J 

[  0  (k)Q(k)*dnk, 

(ID 

in  which  the  velocity  mode  Q(k)  is  related  to  the  position 
mode  'ij)i(k)  by  (Fisher,  Scharf,  &  Lahav  1994;  Heavens  & 
Taylor  1995;  Hamilton  1998,  eq.  [8.13]) 

0  =M^i,  (12) 

where  M'  is  the  Hermitian  conjugate  of  the  velocity  part  of  the 
linear  redshift  distortion  operator.  In  the  small-angle,  or  distant 


1  10  100  1000 


Mode  number  i 

Fig.  12. — Relation  between  mode  number  i  and  physical  scale  k.  The 
variance  (x?)  gets  nonnegative  contributions  from  all  k  as  per  eq.  (8).  Fifty 
percent  of  the  contribution  comes  from  below  the  solid  black  curve,  which  we 
can  interpret  as  the  median  lvalue  probed  by  the  zth  mode.  The  shaded 
regions  show  percentiles  of  the  contribution:  from  outside  in,  they  show  the 
^-ranges  giving  99.98%,  99.8%,  98%,  80%,  and  60%  of  the  contribution, 
respectively.  Apart  from  the  first  eight  special  modes,  the  modes  are  ordered 
by  increasing  median  lvalue.  [See  the  electronic  edition  of  the  Journal  for  a 
color  version  of  this  figure.] 
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observer,  approximation,  the  operator  M  takes  the  familiar  vector  on  the  left-hand  side  is  closer  to  the  physics  of  linear 

Kaiser  (1987)  form,  a  diagonal  operator  in  Fourier  space  redshift  distortions.  Indeed,  inverting  equation  (17), 


M 


& 


(13) 


where  [ik  =  k-z  is  the  cosine  of  the  angle  between  the 
wavevector  k  and  the  line  of  sight  z.  Here,  however,  we  do  not 
assume  the  small-angle  approximation  but  rather  take  into 
account  the  full  radial  nature  of  redshift  distortions.  Radial 
redshift  distortions  destroy  translation  invariance,  so  the  radial 
redshift  distortion  operator  is  no  longer  diagonal  in  Fourier 
space,  as  it  is  in  the  small-angle  approximation;  indeed,  the 
radial  redshift  distortion  operator  takes  a  rather  complicated 
form  in  Fourier  space  (Hamilton  1998,  eq.  [4.37]).  The  radial 
redshift  distortion  operator  takes  a  simpler  form  in  real  space, 
where  M,  expressed  in  the  frame  of  the  Local  Group,  can  be 
written  as  the  integrodifferential  operator  (Hamilton  1998, 
eq.  [4.46]) 


M  = 


’  d2  a(r)r 

d 

y 

dr2  r 

\dr 

dr 

r=0/  . 

(14) 


with  a  ( r )  the  logarithmic  derivative  of  r2  times  the  selection 
function  h(r). 
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(18) 


we  see  that  we  can  use  this  last  equation  as  an  improved  defi¬ 
nition  of  monopole,  quadrupole,  and  hexadecapole,  remaining 
valid  even  in  the  regime  where  the  small-angle  approximation 
fails.  For  the  reader  more  used  to  thinking  in  terms  of  the 
multipole  formalism,  the  bottom  line  is  that  our  main  mea¬ 
surement  Pgg(k )  is  basically  the  monopole  power  minus  half  the 
quadrupole  power  plus  |  of  the  hexadecapole  power,  as  per 
equation  ( 1 7). 

Because  redshift  distortions  displace  galaxies  only  along 
the  line  of  sight,  the  transverse,  or  angular,  power  spectrum 
is  completely  unaffected  by  redshift  distortions,  a  point  em¬ 
phasized  by  Hamilton  &  Tegmark  (2002).  In  the  small-angle 
approximation,  the  galaxy-galaxy  power  spectrum  equals  the 
redshift-space  power  spectrum  in  the  transverse  direction, 

.  (_l)£/2(£  _  DM 

pgg(k) = ps(nk  =  o)  =  j2  e)A — -pm,  (i9) 

t  even 


a(r) 


d  In  [r2n(r)\ 
9  In  r 


(15) 


The  d/dr\r=0  term  inside  parentheses  in  equation  (14),  which 
subtracts  from  the  first  term  d/dr  its  value  at  the  position 
r  =  0  of  the  Local  Group,  is  the  term  that  arises  from  the 
motion  of  the  Local  Group.  The  Hermitian  conjugate  M', 
which  enters  equation  (12)  for  the  velocity  mode  Q,  can  be 
written  (Hamilton  1998,  eq.  [4.50]) 


2  9  1 

f  d  a  (r) 

V--- 

d 

dr  ' 

y  dr  r 

/  r2 

dr 

V  2a(r)r, 


r=0 


(16) 


in  which  the  last  term  is  again  the  term  arising  from  the  mo¬ 
tion  of  the  Local  Group. 

Although  the  definition  of  these  three  power  spectra 
assumes  that  redshift  distortions  conform  to  the  linear  Kaiser 
model,  they  measure  useful  information  from  the  data  even  if 
the  linear  model  fails.  In  the  small-angle  (distant  observer) 
approximation,  they  reduce  to  simple  linear  combinations  of 
the  monopole,  quadrupole,  and  hexadecapole  power  spectra  in 
redshift  space  (Cole  et  al.  1994;  Hamilton  1998): 
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(17) 


Whereas  the  vector  on  the  right-hand  side  is  closer  to  the 
measurements  (and  also  more  familiar  in  the  literature),  the 


which  is  true  in  all  circumstances,  linear  or  nonlinear,  re¬ 
gardless  of  the  character  of  redshift  distortions.  The  coef¬ 
ficients  of  the  expansion  (19)  are  the  values  of  Legendre 
polynomials  Ve(^k)  in  the  transverse  direction  fj,k  =  0.  The 
first  few  terms  of  the  expansion  (19)  are 

Pgg(k)  =  P*(k)  -  X-Ps2{k)  +  ^-P\(k)  -  ^ P%{k )  +  ..., 

(20) 

which  shows  that  our  linear  estimate  of  Pgg(k)  is  effectively 
the  expansion  (19)  truncated  at  the  i  —  4  harmonic,  as  pre¬ 
dicted  by  linear  theory  (Kaiser  1987).  We  expect  on  general 
grounds  that  the  linear  estimate  of  Pgg(k )  will  underestimate 
the  true  galaxy-galaxy  power  spectrum  at  nonlinear  scales 
(Hamilton  &  Tegmark  2002),  although  this  underestimate 
should  be  mitigated  by  FOG  compression. 

In  §  4,  we  demonstrate  with  Monte  Carlo  simulations  that 
Pgg  faithfully  recovers  the  true  real-space  galaxy  power 
spectrum  on  large  scales,  and  we  quantify  what  constitutes 
“large,”  finding  accurate  recovery  on  substantially  smaller 
scales  than  those  on  which  the  Kaiser  approximation  is  valid. 

A  wide  range  of  approximations  for  Pgv  and  Pvv  have  been 
introduced  in  the  literature.  By  using  our  notation,  the  Kaiser 
(1987)  approximation  becomes  simply 

Pgv(k)=mr(k)Pgg(k),  (21) 

pvv(k)=m2p9m,  (22) 

where  (3{k)  =  f  /b(k),  b{k )  is  the  bias  factor,  r{k)  is  the  di¬ 
mensionless  correlation  coefficient  between  the  galaxy  and 
matter  density  (Dekel  &  Lahav  1999;  Pen  1998;  Tegmark  & 
Peebles  1998),  and /  «  f 6  was  defined  above.  Since  both  b 
and  r  can  in  principle  depend  on  scale,  we  have  two  unknown 
functions  f3(k)  and  r(k)  that  can  in  principle  be  determined 
uniquely  from  the  two  measured  ratios  Pgv(k)/Pgg(k)  and 
Pm(k)/Pgg{k)  in  the  Kaiser  approximation.  Further  popular 
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approximations  in  the  literature  are  that  both  b  and  r  are 
constant,  and  most  workers  also  assume  r  =  1  despite  some 
observational  (Tegmark  &  Bromley  1999;  Blanton  2000)  and 
simulational  (Blanton  et  al.  1999;  Cen  &  Ostriker  2000; 
Somerville  et  al.  2001)  evidence  that  r  may  be  of  the  order  of 
0.9  for  some  galaxies.25 

We  do  not  make  any  of  these  approximations  in  our  basic 
data  analysis,  simply  reporting  measurements  of Pgg(k),Pgv(k), 
and  Pm(k )  from  the  SDSS  data.  In  §  4  we  use  the  approximation 
that  £  >  4  anisotropies  are  negligible,  assessing  its  accuracy 
with  Monte  Carlo  simulations  and  tunable  FOG  compression, 
but  the  reader  wishing  to  avoid  approximations  can  simply  fit 
better  simulations  directly  to  our  three  measured  curves.  Spe¬ 
cifically,  using  an  axis  of  a  periodic  cube  as  the  line-of-sight 
direction,  so  that  the  distant  observer  approximation  holds 
perfectly,  one  can  compute  the  monopole,  quadrupole,  and 
hexadecapole  components  of  the  redshift-space  power  spec¬ 
trum  and  transform  them  to  Pgg,  Pgv,  and  Pm,  via  equation  (17). 

3.6.  Step  3:  Power  Spectrum  Estimation 

All  our  information  about  the  SDSS  density  field  is  encoded 
in  the  4000-dimensional  vector  x  plotted  in  Figure  10,  and 
equation  (8)  shows  that  the  covariance  matrix  of  Jt:  depends 
linearly  on  the  three  power  spectra  that  we  want  to  measure. 
We  wish  to  invert  equations  (5)  and  (8)  to  estimate  the  power 
spectra  from  the  data  vector.  This  problem  is  mathematically 
equivalent  to  that  of  measuring  the  power  spectrum  from  a 
CMB  map  and  can  be  solved  optimally  with  so-called  qua¬ 
dratic  estimators  (Tegmark  1997b;  Bond,  Jaffe,  &  Knox 
2000).  We  describe  our  analysis  method  in  full  detail  in  Ap¬ 
pendix  B.  Flowever,  since  it  is  important  for  the  interpretation, 
let  us  briefly  review  here  how  the  measurements  are  computed 
from  the  input  data  x. 

We  parameterize  our  three  power  spectra  by  their  ampli¬ 
tudes  in  97  separate  logarithmically  spaced  &-bands  as  detailed 
in  Appendix  B,  so  our  goal  is  to  measure  3  x  97  =  291  band 
powers  pi,  i  =  1,  .  .  .  ,291.  Quadratic  estimators  p,  are  simply 
quadratic  functions  of  the  data  vector  x,  and  the  most  general 
unbiased  case  can  be  written  as 

Pi  =  x'QiX  -  tr  (NQi)  (23) 

for  some  symmetric  N  x  TV-dimensional  matrices  Qp,  the 
second  term  merely  subtracts  the  expected  contribution  from 
shot  noise. 

The  basic  idea  behind  quadratic  estimators  is  that  each 
matrix  Qt  can  be  chosen  to  effectively  Fourier  transform  the 
density  field,  square  the  Fourier  modes  in  the  ;th  power 
spectrum  band,  and  average  the  results  together,  thereby 
probing  the  power  spectrum  on  that  scale.  Grouping  the 
parameters  p,  and  the  estimators  pi  into  vectors  denoted  p  and 
p ,  the  expectation  value  and  covariance  of  the  measurements 
is  given  by 

(p)  =  Wp ,  (24) 

cov(p)  =  {pp‘)  -  {p)(py  =  S,  (25) 

25  Although  r  <  1  is  normally  referred  to  as  stochastic  bias,  this  does  of 
course  not  imply  any  randomness  in  the  galaxy  formation  process,  merely  that 
additional  factors  besides  the  present-day  dark  matter  density  may  be  im¬ 
portant  (gas  temperature,  say).  The  evidence  for  r  <  1  thus  far  comes  from 
scales  smaller  than  those  that  are  the  focus  of  this  paper.  More  details  on  the 
relationship  between  our  three  power  spectra  and  the  stochastic  bias  formalism 
are  given  in  §  3.4  of  THX02. 


where  the  matrices  W  and  S  can  be  computed  from  the  Q, 
matrices  and  the  geometry  alone  via  equations  (B3)  and  (B4) 
in  Appendix  B. 

As  detailed  in  Appendix  B,  there  are  several  attractive 
choices  of  Q  matrices,  each  giving  different  desirable  prop¬ 
erties  to  the  matrices  W  and  S.  Figure  13  shows  the  power 
spectrum  measurements  p  for  the  baseline  galaxy  sample  by 
using  the  choice  of  Q ,  that  gives  the  smallest  error  bars,  and 
Figure  14  shows  them  by  using  a  better  choice  described 
below. 

Although  Figure  13  looks  impressive,  it  fails  to  convey 
two  important  complications.  The  first  is  that  the  error  bars 
are  strongly  correlated  between  neighboring  bands;  i.e.,  the 
covariance  matrix  X  is  far  from  diagonal.  The  second  com¬ 
plication  involves  the  matrix  W,  known  as  the  window  matrix. 
The  Q  matrices  are  normalized  so  that  each  row  of  the  window 
matrix  sums  to  unity.  Equation  (24)  shows  that  this  normali¬ 
zation  enables  us  to  interpret  each  band  power  measurement 
pi  as  a  weighted  average  of  the  true  power  spectrum  pj,  the 
elements  of  the  zth  row  of  W  giving  the  weights  (the  so-called 
window  function).  In  short,  the  window  functions  connect  our 
measurements  p  to  the  underlying  power  spectrum  parameters 
p.  The  windows  are  plotted  in  Figure  15,  and  we  see  that  they 
are  complicated  in  two  different  ways,  making  Figure  13  hard 
to  interpret: 

1.  Smearing. — They  have  a  nonzero  width  AAr,  so  that  our 
estimate  of  the  power  on  some  scale  k  is  really  the  weighted 
average  of  the  power  over  a  range  of  scales  around  k.  In  other 
words,  Figure  13  shows  a  measurement  of  the  true  power 
spectrum  that  has  been  smoothed  and  convolved  with  rather 
broad  window  functions. 

2.  Leakage. — They  mix  the  gg,  gv,  and  vv  power  spectra,  so 
that  a  nominal  estimate  of  gv,  say,  is  really  a  weighted  average 
of  gg,  gv,  and  vv  power.  This  is  why  the  signal-to-noise  ratio  of 
gv  and  vv  appears  so  high  in  Figure  13. 

As  described  in  Appendix  B,  both  the  correlation  problem 
and  the  smearing  problem  can  be  tackled  in  one  fell  swoop 
with  a  better  choice  of  quadratic  estimators  that  give  uncor¬ 
related  error  bars  and  narrower  window  functions,  shown  in 
Figure  16,  and  the  leakage  problem  can  be  solved  as  shown 
in  Figure  17.  This  choice  makes  the  covariance  matrix  S  for 
the  measured  vector  p  diagonal  (combining  shot  noise  and 
sample  variance  errors),  so  it  is  completely  characterized  by 
its  diagonal  elements,  given  by  the  error  bars  in  Table  2  and 
Figure  14.  A  clearer  and  less  cluttered  view  of  a  sample  win¬ 
dow  function  for  this  uncorrelated  case  is  given  in  Figure  18 
( top  left).  We  see  that  such  a  window  is  almost  never  negative, 
and  tends  to  be  sharply  peaked  around  the  A'-value  that  it  is 
designed  to  probe.26 

Let  us  now  turn  to  the  remaining  problem:  leakage.  The 
leakage  results  from  a  combination  of  two  effects:  difficulties 
in  separating  the  monopole,  quadrupole,  and  hexadecapole 
power  given  the  complicated  survey  geometry  and  the  mixing 
of  these  three  multipoles  given  by  equation  (17).  Figures  15, 

26  Its  characteristic  width  A k  corresponds  roughly  to  the  inverse  width  of 
the  survey  volume  in  its  narrowest  direction  (Tegmark  1995),  so  the  windows 
will  get  narrower  as  the  SDSS  becomes  more  complete  and  the  thin  sky  slices 
seen  in  Fig.  1  thicken  and  merge.  Windows  farther  to  the  left  are  slightly 
narrower  (when  plotted  on  a  linear  &-scale  as  opposed  to  the  logarithmic  scale 
used  here),  since  they  probe  more  distant  galaxies  and  hence  a  larger  effective 
volume.  However,  since  our  sample  contains  very  few  galaxies  with  z  0.2, 
the  window  width  A k  approaches  a  constant  as  we  keep  moving  to  the  left  in 
Fig.  16,  causing  the  windows  to  look  wider  on  our  logarithmic  axis. 
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Fig.  13. — Minimum- variance  quadratic  estimators  pi  nominally  measuring 
(top)  the  galaxy-galaxy  power  spectrum,  (middle)  the  galaxy-velocity  power 
spectrum,  and  (bottom)  the  velocity-velocity  power  spectrum  for  the  baseline 
galaxy  sample.  They  cannot  be  directly  interpreted  as  power  spectrum 
measurements,  since  each  point  probes  a  linear  combination  of  all  three  power 
spectra  over  a  broad  range  of  scales,  typically  centered  at  a  A:-value  different 
than  the  nominal  k  where  it  is  plotted.  Moreover,  nearby  points  are  strongly 
correlated,  causing  this  plot  to  overrepresent  the  amount  of  information 
present  in  the  data.  The  solid  curves  show  the  window-convolved  prior  power 
spectrum  Wp,  and  the  dashed  curve  shows  the  shot-noise  contribution  sub¬ 
tracted  out.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this 
figure.] 


16,  and  17  show  the  zth  window  function  (the  7th  row  of  the  W 
matrix)  as  three  curves  plotted  in  the  top,  middle,  and  bottom 
panels,  giving  the  sensitivity  of  the  estimator  pj  to  gg,  gv,  and 
vv  power,  respectively.  If  there  were  no  leakage,  then  all 
curves  in  the  six  off-diagonal  panels  in  these  figures  would  be 
identically  zero.  This  is  not  the  case,  but  the  area  under  the 
curves  is  much  reduced  in  the  off-diagonal  panels,  as  is  shown 
by  comparing  the  leftmost  and  middle  panels  of  Figure  19. 
Switching  to  uncorrelated  quadratic  estimators  causes  a  sub¬ 
stantial  leakage  reduction  as  a  side  benefit,  but  that  leakage  is 
still  nonnegligible:  for  instance,  an  estimate  of  the  gg  power  is 
seen  to  give  about  15%  weight  to  Pgv  and  about  2%  weight  to 
Pvv,  with  these  percentages  depending  only  weakly  on  k. 


Fig.  14. — Decorrelated  and  disentangled  measurements  of  (top)  the  galaxy- 
galaxy  power  spectrum,  (middle)  the  galaxy-velocity  power  spectrum,  and 
(bottom)  the  velocity-velocity  power  spectrum  for  the  baseline  galaxy  sample. 
Each  point  is  plotted  at  the  lvalue  that  is  the  median  of  its  window  function, 
and  the  horizontal  bars  range  from  the  20th  to  the  80th  percentile  of  the 
window  function.  The  values  of  Pgg(k)  are  given  in  Table  3.  From  top  to 
bottom,  the  three  curves  show  our  prior  for  Pgg(k),  Pgv(k),  and  Pvv(k).  Note 
that  most  of  the  information  in  the  survey  is  in  the  galaxy-galaxy  spectrum. 
Band  power  measurements  with  very  low  information  content  have  been 
binned  into  fewer  (still  uncorrelated)  bands.  All  these  measurements  are  for 
our  baseline  FOG  compression  threshold  of  200.  Unlike  Fig.  13,  these  points 
are  uncorrelated  (affecting  all  three  panels),  and  the  leakage  between  gg,  gv, 
and  vv  has  been  removed  (affecting  mostly  the  lower  panels),  giving  much 
larger  (and  easier  to  interpret)  error  bars.  [See  the  electronic  edition  of  the 
Journal  for  a  color  version  of  this  figure .] 

As  detailed  in  Appendix  B,  we  can  eliminate  the  leakage 
problem  and  measure  one  power  spectrum,  say,  Pgg{k),  without 
any  assumptions  about  the  other  two  by  effectively  marginal¬ 
izing  over  their  amplitudes  separately  for  each  £-band.  This 
procedure  is  equivalent  to  yet  another  choice  of  the  Q  matrices, 
which  we  refer  to  as  “disentangled  .”  As  seen  in  Figure  17  and 
Figure  19  {right),  it  eliminates  leakage  completely  in  the  sense 
that  all  unwanted  (off-diagonal)  window  functions  have  zero 
area.  The  basic  idea  of  the  disentanglement  procedure  is 
illustrated  in  Figure  18:  since  the  gg,  gv,  and  vv  components 
of  the  window  function  have  very  similar  shape,  differing 
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Fig.  15. — By  using  the  correlated  minimum-variance  estimators,  the  window  functions  are  shown  for  those  /.-bands  with  nonnegligible  information  content. 
The  ith  row  of  W  typically  peaks  at  the  ith  band,  the  scale  k  that  the  band  power  estimator  //,  was  designed  to  probe.  The  three  rows  correspond  to  the 
estimators  of  gg,  gv,  and  vv  power,  and  the  three  columns,  to  their  sensitivity  to  gg,  gv,  and  w  power.  For  example,  the  window  function  of  the  quadratic 
estimator  targeting  Pgv(k )  at  k  —  0. 1  h  Mpc  1  is  given  by  the  three  curves  in  the  middle  column  peaking  at  k  =  0.1  h  Mpc  1  (top,  middle,  and  bottom  panels), 
and  the  normalization  is  such  that  the  sum  of  the  areas  under  these  three  curves  is  unity.  [ See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this 
figure .] 


essentially  only  in  amplitude,  it  is  possible  to  form  linear 
combinations  of  them  that  for  all  practical  purposes  vanish. 
In  forming  these  linear  combinations,  we  do  introduce  sta¬ 
tistical  correlations  between  Pgg{k),  Pgv{k),  and  Pm{k)  at  a 
given  value  of  k\  the  values  at  different  values  of  k  remain 
uncorrelated. 

In  summary,  we  have  measured  the  three  power  spectra 
Pgg{k),  Pgv(k),  and  Pm,(k),  obtaining  the  results  shown  in 
Figure  14.  These  basic  measurements  are  given  in  Table  2.27 
The  measurements  make  no  assumptions  whatsoever  about 
redshift-space  distortions,  and  the  issue  of  whether  the  density 


27  Also  available  at  http://www.hep.upenn.edu/~max/sdss.html,  together 
with  their  window  matrix  and  likelihood  calculation  software,  incorporating 
the  bias  correction  described  in  §  5. 


fluctuations  are  Gaussian  affects  only  the  error  bars,  not  the 
measurements  themselves. 

In  the  next  section,  we  model  the  effect  of  redshift  dis¬ 
tortions  and  make  what  we  argue  is  a  more  accurate  estimate 
of  Pgg{k).  However,  the  conservative  reader  trusting  only 
her/his  own  modeling  can  in  principle  stop  right  here  and  fit 
simulations  directly  to  our  measurements  from  Figure  14, 
which  are  given  in  Table  2. 

4.  ACCOUNTING  FOR  REDSHIFT  SPACE  DISTORTIONS 

Thus  far,  we  have  measured  the  SDSS  galaxy  power  spec¬ 
trum  and  its  redshift-space  anisotropy,  encoded  in  the  three 
functions  Pgg{k),  Pgv(k),  and  Pm(k).  In  the  present  section,  we 
focus  on  this  anisotropy  to  model,  quantify,  and  correct  for  the 
effects  of  redshift-space  distortions,  producing  an  estimate  of 
the  true  real-space  galaxy  power  spectrum,  Pggtrue  ( k ).  We  use 
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Fig.  16. — Same  as  Fig.  15,  but  using  decorrelated  estimators  (before  disentanglement).  Comparison  with  Fig.  15  shows  that  decorrelation  makes  all  windows 
substantially  narrower.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure.] 


Monte  Carlo  simulations  to  assess  the  accuracy  of  two  alter¬ 
native  approaches: 

1.  Disentanglement  approach. — Perform  FOG  compres¬ 
sion,  then  simply  use  Pgg{k)  from  Figure  14  as  the  estimator 
of  Rjjtrue  (k) . 

2.  Modeling  approach. — Perform  FOG  compression,  then 
make  the  Kaiser  approximation  of  equations  (21)  and  (22)  with 
the  best-fit  constant  values  of  (3  and  r  to  eliminate  Pgv{k)  and 
Pvv(k)  from  the  problem,  solving  for  the  97  decorrelated 
measurements  of  Pgg true  ( k )  (see  Appendix  B  and  THX02  for 
details). 

The  difference  between  the  two  approaches  is  essentially  be¬ 
tween  marginalizing  over  the  other  two  power  spectra  \Pgv(k) 
and  Pvv(k)\  and  modeling  them.  Both  approaches  break  down 
on  small  scales,  so  we  focus  on  quantifying  their  accuracy  as 
a  function  of  k.  We  see  that  although  the  disentanglement 
approach  is  more  robust  to  nonlinearities,  the  modeling  ap¬ 
proach  has  the  advantage  of  roughly  halving  the  error  bars, 
corresponding  to  quadrupling  the  Fisher  information.  The 


gain  comes  from  using  rather  than  discarding  measured  infor¬ 
mation  about  the  amplitudes  of  the  gv  and  vv  power  spectra.  We 
argue  that  the  disentanglement  method  is  overly  conservative, 
especially  on  extremely  large  scales  such  as  A:  <  0.05  h  Mpc  , 
where  we  need  all  the  statistical  power  that  we  can  get. 

There  are  two  separate  sources  of  statistical  bias  in  our 
measurement  that  we  quantify  below.  The  first  is  that  Pgg(k) 
equals  the  true  real-space  power  only  on  scales  on  which  the 
Kaiser  approximation  holds,  generally  underestimating  it  on 
smaller  scales.  The  second  occurs  only  in  the  modeling  ap¬ 
proach,  which  produces  a  biased  measurement  of  Pgg{k )  if  the 
model  parameters  (3  and  r  are  incorrect.  Let  us  begin  our 
investigation  by  studying  the  real  data,  then  turn  to  Monte 
Carlo  simulations  to  better  understand  and  quantify  the  results. 

4.1.  Results  Based  on  the  Data 

Figure  14  shows  that  whereas  we  have  precise  measure¬ 
ments  of  Pgg(k),  we  have  rather  limited  information  about 
Pgv(k)  and  close  to  no  information  about  Pvv(k).  Figure  20 
shows  a  slightly  less  noisy  rendition  of  the  same  information. 
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Fig.  17. — Same  as  Fig.  16,  but  decorrelated  and  disentangled  estimators.  Comparison  with  Fig.  16  shows  that  disentanglement  gives  curves  in  the  off-diagonal 
panels  a  vanishing  average  and  almost  completely  eliminates  leakage  of  gv  and  vv  power  into  estimators  of  gg  power  (left  middle  and  left  bottom).  [See  the 
electronic  edition  of  the  Journal  for  a  color  version  of  this  figure.] 


Here  we  have  taken  all  three  curves  to  have  the  shape  of  the 
prior  power  spectrum  and  plotted  their  best-fit  amplitudes 
relative  to  the  prior.  These  fits  are  performed  cumulatively, 
using  all  measurements  for  all  wavenumbers  <k.  The  three 
bands  give  the  1  a  allowed  ranges  for  gg,  gv,  and  vv,  re¬ 
spectively.  It  is  well  known  that  as  k  increases,  nonlinearities 
become  more  important  and  start  reducing  gv,  eventually 
driving  it  to  negative  values.  This  is  because  the  FOG  effect 
has  the  opposite  sign  of  the  linear  Kaiser  infall,  causing  less 
rather  than  more  radial  power  (or  larger  rather  than  smaller 
radial  correlations,  for  the  reader  preferring  real  space  over 
Fourier  space).  The  fact  that  Figure  20  does  not  show  this 
effect  is  therefore  a  first  encouraging  indication  that  non- 
linearities  have  only  a  minor  impact  on  our  results  over  the 
range  of  scales  that  we  consider.  Since  we  have  used  only 
4000  PKL  modes,  most  of  the  information  from  scales  k  3> 
0.1  h  Mpc  is  excluded  from  our  analysis  (cf.  Fig.  12).  The 
bands  in  the  figure  therefore  stop  getting  thinner  for 
A' »  0.1  h  Mpc-1.  In  other  words,  the  information  contained 
in  our  data  vector  jr  describes  only  a  highly  smoothed  version 
of  the  density  field,  where  nonlinear  effects  are  small. 


The  five  thin  lines  in  Figure  20  correspond  to  our  five  FOG 
compression  thresholds  and  show  several  noteworthy  things. 
First  of  all,  changing  the  FOG  threshold  is  seen  to  have  a 
strong  effect  on  gv  but  almost  no  effect  on  gg  (the  quantity  that 
we  really  care  about  in  this  paper),  providing  another  en¬ 
couraging  indication  that  virialized  structures  do  not  pose  an 
unsurmountable  problem  for  us.  Second,  more  aggressive 
FOG  removal  is  seen  to  raise  the  gv  amplitude.  This  is  the 
expected  sign  of  the  effect,  since  it  removes  (and  eventually 
overcorrects  for)  the  FOG  effect,  which  suppresses  gv.  Third, 
the  gg  and  gv  curve  pentuplets  are  seen  to  diverge  as  k 
increases,  as  nonlinearities  become  more  important.  For 
gv,  the  spread  between  the  baseline  threshold  1  +  Sc  =  200 
and  the  rather  extreme  neighboring  thresholds  (100  and  oo) 
equals  the  error  bar  for  k  ~  0.3  h  Mpc-1,  suggesting  that 
nonlinearity-related  uncertainties  become  comparable  to  sta¬ 
tistical  uncertainties  on  this  scale  when  trying  to  measure  the 
redshift  distortion  parameter  3.  For  gg,  on  the  other  hand,  the 
statistical  uncertainties  dominate  on  all  scales  to  which  we  are 
sensitive.  The  optimal  FOG  compression  threshold  should  be 
expected  to  lie  somewhere  between  our  options  200  and  oo, 
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TABLE  2 

Real-Space  Power  Spectrum  with  the  Disentanglement  Method 


k 

(h  Mpc-1) 

p 

(ft-1  Mpc)3 

P(_LFOG) 
(ft-1  Mpc)3 

A  P 

(ft-1  Mpc)3 

b 

pgpp) 

0.018+nS . 

42098 

41081 

28850 

1.167 

0.023+nUS? . 

28260 

28924 

16394 

1.167 

0.028+HS . 

20880 

20508 

15849 

1.166 

0.032+^1? . 

16903 

17097 

12079 

1.165 

0.037+ooci7 . 

12178 

12119 

9004 

1.164 

0.042+“'“I3 . 

11887 

11996 

6944 

1.163 

0.049+““™ . 

13098 

13094 

5188 

1.161 

0.056+“:“|j . 

13996 

14003 

3847 

1.159 

0.065+Hij . 

10273 

10333 

2847 

1.157 

0.075+nn[I . 

6296 

6366 

2130 

1.153 

0.086+!!!!}? . 

9653 

9687 

1594 

1.149 

o.ioo+“:“j‘ . 

5763 

5814 

1205 

1.144 

0.1 . 

6229 

6273 

921 

1.139 

0.132+HS . 

4693 

4711 

712 

1.132 

0.153+S'^I . 

3263 

3321 

554 

1.123 

0.176+““^ . 

3778 

3811 

437 

1.114 

0.202+2'^ . 

2423 

2428 

356 

1.104 

0.232+^S . 

1891 

1892 

312 

1.093 

0.264+ij'iB! . 

952 

947 

304 

1.082 

0.290+HS . 

1340 

1385 

386 

1.074 

Pgv{k) 

0.022+“““5 . 

52115 

51536 

28798 

1.167 

0.039+!!:!!S . 

17843 

17716 

10870 

1.164 

0.069+“:“‘‘ . 

5451 

5233 

4057 

1.155 

0.120+^i . 

2991 

2746 

1684 

1.137 

0.21  l+n'(u7 . 

1207 

902 

684 

1.101 

0.291+““’ . 

537 

319 

720 

1.074 

Pm(k) 

0.067+nm4 . 

3700 

3583 

7712 

1.156 

0.278+Hjg . 

7 

-81 

1185 

1.078 

Note. — A P  is  the  1  a  error.  These  values  have  been  corrected  for  luminosity- 
dependent  bias  by  dividing  by  the  square  of  the  last  column  and  thus  refer  to  the 
clustering  of  Z,*  galaxies.  The  A>column  gives  the  median  of  the  window  function  and 
its  20th  and  80th  percentiles;  the  exact  window  functions  from  http://www.hep. 
upenn.edu/~max/sdss.html  should  be  used  for  any  quantitative  analysis.  The  Pgg 
errors  are  uncorrelated  with  one  another  but  are  correlated  with  the  Pgv  and  Pvv  errors. 
We  recommend  using  col.  (2)  for  basic  analysis.  Col.  (3)  is  without  FOG  removal 
(i.e.,  with  threshold  6C  —  oo)  and  is  therefore  easier  to  compare  against  numerical 

simulations. 


since  1+  6C  =  200  is  the  approximate  overdensity  of  a  cluster 
that  has  just  formed,  and  many  FOG  systems  will  have  formed 
earlier  and  hence  have  higher  overdensities.  The  other 
thresholds  plotted,  i.e.,  100,  50,  and  25,  are  thus  more  extreme 
and  eventually  unphysical — we  have  used  and  plotted  them 
merely  to  exaggerate  and  illustrate  the  effect  of  FOG  removal 
more  clearly. 

Since  vv  is  so  noisy,  our  main  constraint  on  redshift-space 
distortion  comes  from  the  ratio  of  gv  to  gg  power,  i.e.,  on 
r/3  =  Pgg/Pgv.  Figure  21  shows  our  1  a  constraints  on  r(3  as  a 
function  of  the  maximal  A'-band  included  in  a  cumulative  lit, 
discarding  the  vv  information  to  be  conservative.  The  effect  of 
FOG  removal  is  seen  to  be  smaller  than  the  statistical  errors 
for  all  scales  that  we  consider.  Our  (loose)  constraints  agree 
well  with  a  previous  /3-measurement  from  earlier  SDSS  data 
(Zehavi  et  al.  2002)  and  also  with  measurements  from  the 


2dFGRS  (Peacock  et  al.  2001 ;  THX02),  assuming  that  the  bias 
does  not  differ  dramatically  between  the  r-band-selected 
SDSS  galaxies  and  R-band-selected  2dF  galaxies.  We  are 
unable  to  break  their  near  degeneracy  and  place  strong  con¬ 
straints  on  P  and  r  separately,  but  a  joint  likelihood  analysis 
marginally  favors  r  ~  1 . 

Our  estimate  of  the  real-space  galaxy  power  spectrum  from 
the  disentanglement  approach  is  simply  Figure  14  (top).  The 
corresponding  estimate  using  the  model  approach  is  shown  in 
Figures  22  and  23;  the  values  are  tabulated  in  Table  3.  Here 
we  use  P  =  0.5  and  r  =  1,  which  provides  a  good  fit  to  our 
data.  The  measurements  are  also  tabulated  in  Tables  2  and  3. 
Changing  these  two  parameters  within  their  measurement 
uncertainty  causes  an  uncertainty  of  4%  in  the  overall  nor¬ 
malization  of  the  recovered  gg  power  spectrum  (which  is,  of 
course,  degenerate  with  a  2%  change  in  the  galaxy  bias).  The 
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Fig.  18. — Window  function  for  our  measurement  of  the  49th  band  of  the  galaxy-galaxy  power  (left)  before  and  (right)  after  disentanglement.  Whereas  unwanted 
leakage  of  gv  and  w  power  is  present  initially,  these  unwanted  window  functions  both  average  to  zero  afterward.  The  success  of  this  method  hinges  on  the  fact  that 
since  the  three  initial  functions  (left)  have  similar  shape,  it  is  possible  to  take  linear  combinations  of  them  that  almost  vanish  (right).  This  procedure  is  repeated 
separately  for  each  k. 
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corresponding  window  functions  are  shown  in  Figure  24; 
compare  with  Figure  17. 

To  indicate  how  linear  the  fluctuations  are  on  various  scales, 
Figure  23  shows  the  square  root  of  the  corresponding  dimen¬ 
sionless  power  spectrum,  which  can  be  crudely  interpreted  as 
the  rms  fluctuation  on  that  scale.  This  fluctuation  level  is  seen  to 
drop  below  10%  on  the  largest  scales,  k  ^  0.02  h  Mpc'1,  with 
the  curve  being  strikingly  different  from  a  power  law  (more 
clearly  seen  in  Fig.  22). 28  The  nonlinearity  transition  A  ~  1 
is  seen  to  occur  around  k  ~  0.2  h  Mpc-1,  but  this  is  a  crude 
estimate  since  what  matters  is  of  course  the  fluctuation  level  of 
the  matter,  not  of  the  galaxies,  and  the  two  differ  by  the  bias 
factor.  As  detailed  in  §  7,  ourZ*  galaxies  have  cr%  rts  0.93,  so  if 
as  «  0.8  for  the  matter  as  indicated  by  many  recent  CMB, 
lensing,  and  cluster  studies  (Lahav  et  al.  2002;  Wang  et  al. 
2003;  Bennett  et  al.  2003),  the  fluctuations  are  slightly  more 
linear  than  Figure  23  indicates. 

To  quantify  the  FOG  effect  on  our  recovered  real-space 
power  spectrum,  Figure  25  shows  the  ratio  of  the  measured 
power  spectrum  amplitude  to  its  value  with  our  baseline  FOG 

28  To  make  this  more  quantitative,  we  fit  the  measurements  to  a  parabola  in 
(log  k.  log  P),  obtaining  a  curvature  d  log  P/d  log  k  =  —1.28  ±  0.49.  For  a 
Markov  chain  with  106  models,  99.9%  had  a  <  0,  thereby  driving  yet  another 
nail  into  the  coffin  of  the  fractal  universe  hypothesis  and  any  other  models 
predicting  a  power-law  power  spectrum  (a  =  0). 


compression.  Just  as  we  saw  in  Figure  20,  nonlinearities  be¬ 
come  progressively  more  important  toward  smaller  scales.  Quan¬ 
titatively,  the  disentanglement  method  is  seen  to  be  almost 
unaffected  by  FOG  compression.  Over  the  range  0.1  h  Mpc  1  < 
k  <  0.2  h  Mpc-1  where  the  error  bars  are  smallest,  changing  the 
FOG  compression  threshold  within  the  rather  extreme  range 
lOO-oo  changes  the  measured  fluctuation  amplitude  by  only 
about  1  %,  which  should  be  compared  with  statistical  error  bars 
of  8%  or  more.  The  sensitivity  of  the  modeling  method  to  these 
nonlinear  effects  is  slightly  greater:  1%  at  k  ~  0. 1  h  Mpc-1  and 
4%  at  k  ~  0.2  h  Mpc-1,  again  letting  the  FOG  threshold  vary 
across  the  rather  extreme  range  lOO-oo. 

4.2.  Results  from  Monte  Carlo  Simulations 

We  need  to  quantify  how  accurately  what  we  measure, 
Pgg(k),  recovers  what  we  really  care  about,  i.e.,  the  real-space 
matter  power  spectrum  P(k).  Nonlinear  clustering  per  se 
would  not  bias  quadratic  estimators  of  the  power  spectrum, 
but  how  much  do  nonlinearities  in  the  redshift-space  dis¬ 
tortions  affect  the  results?  Figure  25  shows  that  the  sensitivity 
of  the  Pgg(k)  measurement  to  FOG  nonlinearities  is  around 
1%  at  £~0.1  h  Mpc-1,  i.e.,  negligible  compared  with  our 
statistical  measurement  errors.  Although  FOGs  are  perhaps  the 
most  important  way  in  which  nonlinear  redshift  distortions 
manifest  themselves,  mildly  nonlinear  effects  on  larger  scales 
are  also  important  (Scoccimarro  et  al.  2001;  R.  Scoccimarro 
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Fig.  19. — Leakage  is  shown  for  the  (left)  correlated,  (middle)  decorrelated,  and  (right)  disentangled  methods.  From  top  to  bottom  within  each  box,  the  three  bands 
(dark  gray,  gray,  and  light  gray)  show  the  relative  contributions  from  gg,  gv,  and  vv  power,  respectively.  From  top  to  bottom,  the  three  rows  are  for  estimators  of  gg, 
gv,  and  vv  power,  respectively.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 
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2004,  in  preparation).  To  be  prudent,  we  therefore  comple¬ 
ment  the  above-mentioned  tests  with  a  Monte  Carlo  analysis 
in  which  the  true  P(k)  is  known  and  we  can  directly  determine 
how  well  we  recover  it. 

We  use  two  suites  of  Monte  Carlo  simulations  as  summa¬ 
rized  in  Table  1.  The  first  consists  of  275  simulations  con¬ 
structed  with  the  PTHalos  code  (Scoccimarro  &  Sheth  2002), 
covering  1395  deg2  with  an  angular  completeness  map 
corresponding  to  the  northern  part  of  SDSS  sample9,  an  ear¬ 
lier  version  of  sample  11  discussed  in  Appendix  A.  In  short, 
this  code  is  a  fast  approximate  method  to  build  non-Gaussian 
density  fields  with  the  halo  model.  It  produces  realistic  cor¬ 
relation  functions  and  includes  nontrivial  galaxy  biasing  by 
placing  galaxies  within  dark  matter  halos  with  a  prescribed 
halo  occupation  number  as  a  function  of  halo  mass.  The 
second  suite  of  simulated  surveys  is  based  on  the  Hubble 
volume  ACDM  n-body  simulation  (Frenk  et  al.  2000;  Evrard 
et  al.  2002).  Ten  mock  surveys  were  produced  by  sparse- 
sampling  different  parts  of  the  simulation  cube  to  reproduce 
the  three-dimensional  selection  function  n(r)  for  SDSS 
sample8,  so  although  these  mock  surveys  include  fully  non¬ 


linear  gravitational  clustering,  they  have  trivial  light-to-mass  bias 
with  b  =  r  =  1  (the  “galaxies”  are  simply  a  random  subset  of 
the  dark  matter  particles). 

Figure  26  shows  that  the  average  Pgg(k )  recovered  by  using 
the  methodology  described  in  this  paper  from  the  PTHalos 
simulations  agrees  with  the  matter  P(k)  on  all  relevant  scales 
to  within  the  sensitivity  we  can  test,  as  expected  given  the 
above  indications  that  the  effect  of  nonlinearities  on  Pgg  is 
small.  It  also  confirms  that  the  analysis  pipeline  produces 
unbiased  results  (this  was  also  demonstrated  with  extensive 
Monte  Carlo  simulations  in  THX02).  The  mock  surveys  based 
on  the  Hubble  volume  simulation  give  similar  agreement,  al¬ 
though  with  larger  noise  since  there  are  only  10  of  them. 

Since  the  possible  biases  that  we  wish  to  quantify  are  so 
small  (at  the  percent  level),  it  is  desirable  to  have  still  more 
statistical  testing  power  than  these  numerical  experiments 
provide.  In  particular,  we  wish  to  test  the  breakdown  of  the 
Kaiser  approximation  as  a  function  of  scale;  here  we  are  not 
concerned  with  the  effects  of  the  survey  geometry.  For  the  275 
PTHalos  simulations,  we  therefore  measure  the  various  power 
spectra  by  using  all  of  the  roughly  106  galaxies  in  each  of  the 
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Fig.  20. — Gray  bands  show  the  1  a  allowed  ranges  for  the  amplitude  of  the 
gg,  gv,  and  vv  power,  respectively,  relative  to  the  prior  gg  spectrum,  as  a 
function  of  the  maximum  wavenumber  included  in  the  fit.  The  sets  of  five 
black  curves  show  the  best-fit  values  using  FOG  compression  with  the  five 
density  thresholds  1  +  6C  —  oo  (no  FOG  compression),  200  (our  baseline; 
heavy  curve),  100,  50,  and  25  (successive  curves  go  higher  for  gv  and  lower 
for  gg  and  w).  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of 
this  figure .] 

full  simulation  cubes.  [Since  h(r)  is  now  constant  and  the 
boundary  conditions  are  periodic,  we  do  this  by  simply  using 
fast  Fourier  transforms,  matching  to  the  Kaiser  limit  as  k  — *  0 
to  reduce  sample  variance;  see  Scoccimarro  &  Sheth  2002.] 
No  FOG  correction  was  applied  to  these  simulations. 

The  results  are  shown  in  Figure  27.  The  upper  panel  (on  a 
logarithmic  scale)  shows  the  input  power  spectrum,  the  quan¬ 
tities  Pgg,  Pgv,  and  Pm,  as  well  as  the  monopole,  quadrupole,  and 
hexadecapole  Pq,  Ps2,  and  P\  as  dotted  lines.  The  lower  panel 
shows  (on  a  linear  scale)  the  ratio  of  each  of  these  quantities  to 
the  input  power  spectrum.  In  the  absence  of  nonlinear  clustering 
and  bias,  each  of  these  lines  would  be  horizontal.  We  see  that 
Pgg(k )  agrees  well  with  the  real-space  matter  power  spectrum 
P(k')  on  large  scales  and  progressively  underestimates  it  more 
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Fig.  21. — Gray  band  shows  the  1  a  allowed  range  for  r(3,  assuming  the 
shape  of  the  prior  Pgg{k)  but  marginalizing  over  the  power  spectrum  nor¬ 
malization,  using  FOG  compression  with  our  baseline  density  threshold 
1  +  8C  =  200.  From  bottom  to  top,  the  three  curves  show  the  best-fit  /3  for 
FOG  thresholds  1  +  8C  —  oo  (no  FOG  compression),  200  (our  baseline;  heavy 
curve),  and  100.  [See  the  electronic  edition  of  the  Journal  for  a  color  version 
of  this  figure .] 
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Fig.  22. — Decorrelated  real-space  galaxy-galaxy  power  spectrum  using  the 
modeling  method  {bottom)  for  the  baseline  galaxy  sample  assuming  (3  —  0.5 
and  r  —  1 .  As  discussed  in  the  text,  uncertainty  in  (3  and  r  contribute  to  an 
overall  calibration  uncertainty  of  order  4%,  which  is  not  included  in  these 
error  bars.  To  remove  scale-dependent  bias  caused  by  luminosity-dependent 
clustering,  the  measurements  have  been  divided  by  the  square  of  the  curve  in 
the  top  panel,  which  shows  the  bias  relative  to  Z,*  galaxies.  This  means  that  the 
points  in  the  bottom  panel  can  be  interpreted  as  the  power  spectrum  of  Z,* 
galaxies.  The  solid  curve  {bottom)  is  the  best-fit  linear  ACDM  model  of  §  5. 
[See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 
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Fig.  23. — The  rms  density  fluctuation  amplitude  A (k)1^2  derived  from  the 
modeling  method  power  spectrum  of  Fig.  22,  where  A(&)  =  4nPgg(k)  [A:/(27r)]3 
and  k  is  the  effective  A:- value  from  Table  2. 


TABLE  3 

Real-Space  Power  Spectrum  with  the  Modeling  Method 


k 

(h  Mpc-1) 

P 

1  99 

{hr1  Mpc)3 

g2 

o, 

<1 

b 

0.016+222!; . 

21573 

33320 

1.168 

0.018+2:Sf . 

33255 

24573 

1.167 

0.021+2221 . 

13846 

17712 

1.167 

0.024+2221 . 

38361 

13320 

1.167 

0.028+2225 . 

24143 

10047 

1.166 

0.032+2225 . 

19709 

7414 

1.165 

0.037+2222 . 

12596 

5486 

1.164 

0.043+2:2I2 . 

13559 

4078 

1.163 

0.049+"“'' . 

18311 

2974 

1.161 

0.057+2212 . 

12081 

2140 

1.159 

0.065+2'2in . 

9217 

1580 

1.156 

0.075+22!? . 

9751 

1128 

1.153 

0.087+221? . 

9530 

818 

1.149 

0.100+221! . 

6385 

602 

1.144 

0.116+22?1 . 

5295 

447 

1.138 

o.  134+222! . 

4630 

335 

1.131 

0.154+SSS7 . 

3574 

254 

1.123 

0.178+"“' . 

3394 

195 

1.114 

0.205+22?? . 

2298 

153 

1.103 

0.236+22U . 

1597 

124 

1.092 

0.271+22S . 

1105 

107 

1.080 

0.306+":"+; . 

1013 

110 

1.069 

Note. — A Pgg  is  the  1  a  error,  uncorrelated  between  bands.  These 
values  have  been  corrected  for  luminosity-dependent  bias  by  dividing 
by  the  square  of  the  last  column  (see  §  5)  and  thus  refer  to  the  clus¬ 
tering  of  L*  galaxies.  The  ^-column  gives  the  median  of  the  window 
function  and  its  20th  and  80th  percentiles;  the  exact  window  functions 
from  http://www.hep.upenn.edu/  ~  max/sdss.html  should  be  used  for 
any  quantitative  analysis. 


Fig.  24. — SDSS  window  functions  (rows  of  W;  top )  using  decorrelated 
estimators  for  the  modeling  method.  The  zth  row  of  W  typically  peaks  at  the 
zth  band,  the  scale  k  that  the  band  power  estimator  pi  was  designed  to  probe. 
All  curves  have  been  normalized  to  have  the  same  area,  so  the  highest  peaks 
indicate  the  scales  where  the  window  functions  are  narrowest.  The  turnover  in 
the  envelope  at  k  ~  0.1  h  Mpc-1  reflects  our  running  out  of  information  be¬ 
cause  of  the  omission  of  modes  probing  smaller  scales.  The  32  2dFGRS 
window  functions  estimated  by  Percival  et  al.  (2001)  are  shown  for  compar¬ 
ison,  plotted  with  the  exact  same  conventions.  They  correspond  to  correlated 
rather  than  uncorrelated  measurements.  Their  shapes  and  widths  seem  to  agree 
well  with  the  SDSS  windows  around  k  ~  0.1  h  Mpc-1,  becoming  substan¬ 
tially  wider  on  larger  scales;  this  is  a  key  advantage  of  our  analysis  method. 
[See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


and  more  as  k  increases.  Quantitatively,  it  is  off  by  4%  at  k  ~ 
0.1  h  Mpc-1  and6%atk  ~  0.2  h  Mpc-1,  corresponding  to  2% 
and  3%  in  fluctuation  amplitude,  respectively.  These  numbers 
are  thus  in  the  same  ballpark  as  those  we  found  from  varying  the 
FOG  compression  threshold  above. 

While  the  agreement  is  impressive,  we  note  that  PTHalos 
code  may  not  have  a  fully  accurate  radial  distribution  of 
galaxies  inside  halos,  nor  is  the  halo  occupation  number  as  a 
function  of  mass  uniquely  determined  from  the  observations. 
For  these  reasons  one  should  exercise  caution  when  using 
these  results  in  the  nonlinear  regime  (k  S;  0.2  h  Mpc-1), 
bearing  in  mind  that  different  galaxy  distribution  models  may 
lead  to  larger  differences  between  the  nonlinear  matter  power 
spectrum  and  Psg. 

When  Figure  27  is  compared  with  Figures  14  and  20,  it  is 
striking  that  the  simulations  display  stronger  nonlinearity  than 
the  real  data.  The  simulations  show  Pgv(k)  going  negative  for 
k  S;  0.14  h  Mpc-1,  whereas  the  data  show  no  statistically 
significant  detection  of  negative  Pgv  power  on  any  scale 
probed.  This  difference  reflects  the  fact  that  the  small-scale 
velocity  dispersion  in  the  PTHalos  simulations  is  larger  than 
those  actually  observed.  In  other  words,  our  PTHalos  results 
should  not  be  interpreted  as  our  best  estimate  of  how  large 
the  nonlinear  problems  are  but  rather  more  as  a  worst-case 
scenario  for  the  importance  of  nonlinear  corrections. 

In  the  Kaiser  approximation,  all  curves  in  Figure  27  ( bot¬ 
tom )  would  be  horizontal  lines.  It  is  noteworthy  that  although 
the  strong  nonlinearities  in  the  simulations  cause  the  Kaiser 
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Fig.  25. — Effect  of  FOG  compression  on  the  measured  power  spectrum. 
From  bottom  to  top,  the  three  solid  curves  show  the  factor  by  which  the 
measured  fluctuation  amplitude  Pgg(k )1^2  is  increased  by  FOG  compression 
with  overdensity  thresholds  oo  (no  compression),  200  (our  baseline;  hori¬ 
zontal  line),  and  100,  respectively,  using  the  disentanglement  method.  The 
dashed  curves  show  the  same  ratios  for  the  modeling  method.  To  place  these 
effects  in  context,  the  relative  1  a  measurement  errors  on  the  power  spectrum 
amplitude  are  indicated  by  the  light  gray  band  for  the  disentanglement  method 
and  by  the  gray  band  for  the  modeling  method.  [See  the  electronic  edition  of 
the  Journal  for  a  color  version  of  this  figure .] 


k  [h  Mpc-1] 

Fig.  26. — Squares  show  the  mean  power  spectrum  Pgg(k)  recovered  from 
62  PTHalos  simulations  by  our  analysis  pipeline,  by  using  the  disentangle¬ 
ment  method  without  FOG  compression.  If  these  squares  faithfully  measure 
the  average  real-space  matter  power  spectrum  of  the  simulations  (solid  line), 
then  about  68%  of  them  should  lie  in  the  shaded  band  around  this  curve, 
whose  width  is  given  by  the  1  a  errors  computed  by  our  pipeline  divided  by 
the  square  root  of  the  number  of  simulations.  For  comparison,  the  dotted  curve 
shows  the  linear  power  spectrum  used  as  input  for  the  simulations.  [ See  the 
electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


k  [h  Mpc-1] 

Fig.  27. — Power  spectra  from  275  PTHalos  mock  catalogs,  quantifying 
how  accurately  our  Pgg(k)  statistic  recovers  the  true  power  spectrum.  The 
thick  black  curve  in  the  upper  panel  shows  the  real-space  power  spectrum  that 
we  wish  to  recover,  on  a  logarithmic  scale.  The  lower  panel,  on  a  linear  scale, 
shows  the  same  curves  as  the  upper  panel,  but  divided  by  this  reference  model. 
The  thin  solid  black  curve  is  the  linear  power  spectrum  that  was  taken  as  input 
for  the  simulations.  The  thick  dark  gray,  gray,  and  light  gray  curves  show  the 
recovered  gg,  gv,  and  w  power  spectra,  and  the  three  dotted  curves  show  the 
redshift-space  monopole,  quadrupole,  and  hexadecapole  power  from  top  to 
bottom  on  the  left-hand  side  (see  eq.  [17]),  all  curves  being  averages  from  275 
simulations  using  about  106  galaxies  each  in  the  full  simulation  cube.  Where 
they  are  negative,  the  gv  and  quadrupole  curves  are  plotted  as  positive  and 
dashed  in  the  upper  panel.  The  velocity  dispersion  is  higher  than  in  the  real 
SDSS  data,  and  no  FOG  compression  has  been  performed,  so  this  should  be 
viewed  as  a  worst  case  scenario.  Nonetheless,  the  figure  shows  that  the  Pgg(k) 
(dark  gray)  recovers  the  true  power  spectrum  (thick  black  curve)  to  within  a 
few  percent  at  k  —  0.1  h  Mpc-1  even  though  strong  departures  from  the 
Kaiser  approximation  (which  predicts  that  all  curves  are  horizontal  in  the 
lower  panel)  are  evident  in  the  Pgv{k)  curve  (thick  gray  curve)  on  these  scales. 
The  reason  that  our  method  works  so  well  is  that  Pgg(k)  recovers  the  trans¬ 
verse  power  (which  is  unaffected  by  redshift  distortions)  beyond  the  Kaiser 
approximation,  requiring  merely  that  i  >  6  anisotropies  be  negligible.  [See  the 
electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 

approximation  for  P’'0(k')  and  Ps2(k)  (Fig.  27,  dotted  lines )  to 
break  down  on  very  large  scales,  k^  0.02  h  Mpc-1,  the 
combination  that  represents  Pgg(k)  remains  an  accurate  esti¬ 
mate  of  P^e(k)  down  to  much  smaller  scales.  We  obtain 
similar  results  by  using  the  analytic  halo  model  approach  of 
Seljak  (2001)  in  place  of  our  simulations.  R.  Scoccimarro 
(2004,  in  preparation)  shows  that  this  is  in  fact  a  generic  re¬ 
sult:  as  long  as  the  wavenumber  k  times  the  rms  pairwise 
velocity  dispersion  is  smaller  than  the  Flubble  parameter 
H,  Pl™e{k)  is  accurately  approximated  by  equation  (17)  even  if 
the  coefficients  in  this  expansion  are  not  well  approximated 
by  the  Kaiser  formula.29  This  can  be  intuitively  understood 
from  the  fact  that  Pgg(k)  is  equal  to  transverse  power  under  all 
circumstances,  linear  or  nonlinear,  as  exploited  in  Flamilton  & 
Tegmark  (2002).  As  long  as  redshift  distortions  can  be  rea¬ 
sonably  approximated  by  quadrupole  and  hexadecapole  dis¬ 
tortions,  then  the  arbitrary  functions  Pgv(k)  and  Pmi(k)  contain 

29  Unfortunately,  this  useful  result  does  not  hold  for  Pqh(k)  or  P„(k),  so 
these  two  functions  cannot  be  interpreted  as  simply  the  galaxy-velocity  and 
velocity-velocity  power  when  nonlinearities  are  important. 
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enough  freedom  to  model  distortions  completely,  even  if  they 
do  not  conform  to  the  Kaiser  model. 

A  third  and  final  piece  of  evidence  that  nonlinearities  have 
no  major  effect  on  our  measurement  of  the  large-scale  real- 
space  power  comes  from  a  direct  comparison  of  Pgg(k)  re¬ 
covered  with  our  disentanglement  and  modeling  methods. 
Although  the  former  has  about  twice  as  much  scatter  as  the 
latter,  the  two  measurements  show  excellent  agreement.  There 
is  no  hint  of  systematic  differences  between  the  two  on  any 
scale. 

The  bottom  line  of  this  section  is  that  although  estimates  of 
the  redshift-space  distortions  (estimates  of  /?,  the  gv/gg  ratio, 
the  quadrupole-to-monopole  ratio,  etc.)  are  very  sensitive  to 
nonlinear  effects,  our  estimates  of  the  real-space  matter  power 
are  not.  We  have  argued  that  any  scale-dependent  statistical 
bias  in  our  Pgg{k)  results  due  to  nonlinear  redshift  distortions 
(or  errors  in  our  code)  is  smaller  than  a  few  percent  for 
k  <  0.1  h  Mpc~',  i.e.,  that  the  systematic  errors  associated 
with  this  are  negligible  compared  with  the  statistical  errors. 

5.  ACCOUNTING  FOR  LUMINOSITY-DEPENDENT  BIAS 

We  have  now  measured  the  real-space  power  spectrum 
Pgg(k)  of  the  SDSS  galaxies,  obtaining  the  results  shown  in 
Figure  22.  The  goal  of  this  section  is  to  compute  and  apply  a 
small  fr-40%)  scale-dependent  bias  correction,  producing  a 
curve  proportional  to  the  underlying  matter  power  spectrum 
and  usable  for  cosmological  parameter  estimation. 

As  discussed  in  §  3.1.3,  there  is  good  reason  to  believe  that 
bias  is  complicated  on  small  scales,  yet  simple  and  essentially 
scale-independent  on  the  extremely  large  scales  L=2n/k^ 
60  h~l  Mpc  that  are  the  focus  of  this  paper.30  However,  since 
this  scale-independent  bias  factor  depends  on  luminosity 
(among  other  galaxy  properties),  we  should  expect  to  intro¬ 
duce  an  artificial  scale  dependence  of  bias  from  the  magnitude- 
limited  nature  of  our  sample. 

It  is  easy  to  understand  how  luminosity-dependent  cluster¬ 
ing  can  masquerade  as  scale-dependent  bias.  Since  luminous 
galaxies  dominate  the  sample  at  large  distances  and  dim  ones 
dominate  nearby,  a  measurement  of  Pgg(k)  on  very  large 
scales  is  statistically  dominated  by  luminous  galaxies  whereas 
a  measurement  on  small  scales  is  dominated  by  dim  ones 
(which  have  much  higher  number  density).  Since  luminous 
galaxies  cluster  more  than  dim  ones,  the  measured  power 
spectrum  will  therefore  be  redder  than  the  matter  power 
spectrum,  with  a  lower  ratio  of  small-scale  to  large-scale 
power. 

Below  we  quantify  and  correct  for  this  effect.  We  emphasize 
that  this  is  not  intended  to  be  the  mother  of  all  bias  treatments 
and  the  final  word  on  the  subject.  Rather,  this  artificial  red-tilt 
is  a  small  (^10%)  effect  that  has  never  previously  been 
quantified,  and  we  simply  wish  to  make  a  first-order  estimate 
of  it.  We  start  by  measuring  the  luminosity  dependence  of  bias 
by  using  our  volume-limited  subsamples  in  the  next  section, 
then  use  this  to  compute  the  scale-dependent  effect. 

30  On  large  scales,  bias  can  also  introduce  an  additive  (as  opposed  to 
multiplicative)  constant,  related  to  halo  shot  noise,  thereby  affecting  the  shape 
of  the  power  spectrum  on  scales  larger  than  the  turnover  (Scherrer  &  Weinberg 
1998;  Seljak  2001;  Durrer  et  al.  2003).  Although  this  effect  is  negligible  for 
k  ^  0.003  h~l  Mpc  and  is  therefore  unimportant  for  the  present  paper,  it  may 
be  important  for  the  upcoming  analysis  of  the  SDSS  luminous  red  galaxy 
(LRG)  sample,  both  because  the  halo  shot  noise  effect  is  larger  for  such  rare 
objects  and  because  LRGs  probe  P(k)  out  to  larger  scales  than  does  the  main 
SDSS  galaxy  sample  analyzed  here. 


It  has  been  long  known  (Davis  &  Geller  1976;  Dressier 
1980)  that  galaxy  bias  depends  on  other  galaxy  properties  as 
well,  e.g.,  morphological  type,  color,  and  environment.  For¬ 
tunately,  the  only  intrinsic  property  that  determines  whether  a 
galaxy  gets  included  in  our  baseline  sample  is  its  luminosity, 
so  we  can  ignore  dependence  on  all  other  properties  for  our 
present  purposes  (type  dependence  of  clustering  is  of  course  a 
fascinating  subject  of  its  own  and  will  be  the  topic  of  future 
SDSS  papers). 

5.1.  Measurement  of  the  Luminosity’  Dependence  of  Bias 

To  quantify  the  luminosity  dependence  of  bias  for  the  SDSS 
galaxies,  we  repeat  our  entire  analysis  for  each  of  the  volume- 
limited  samples  L2-L7  specified  in  Table  1  and  plotted  in  §  2 
(samples  LI  and  L8  contain  too  few  galaxies  to  be  useful). 
The  resulting  power  spectra  are  shown  in  Figure  28.  To  avoid 
excessive  clutter  in  this  figure,  we  plot  the  minimum  variance 
power  spectrum  estimate  described  in  Appendix  B.3.1.  To 
indicate  that  the  measurement  errors  are  correlated  between 
A;-bins,  we  show  the  measurements  as  a  shaded  band  rather 
than  as  separate  points,  with  the  vertical  thickness  of  the  band 
corresponding  to  the  1  <j  uncertainty.  (The  bias  fitting  below 
uses  the  full  covariance  matrix  and  is  of  course  independent  of 
what  plotting  convention  is  used,  as  is  x2  computed  with 
eq.  [B8].) 

Figure  28  shows  that  all  power  spectra  have  roughly  the 
same  shape,  increasing  in  amplitude  as  the  galaxies  become 
more  luminous.  This  is  seen  more  clearly  in  Figure  29,  where 
we  have  divided  them  all  by  the  linear  power  spectrum  of  the 
simple  ACDM  reference  model  described  below. 

To  quantify  this  similarity  of  shapes,  we  fit  each  of  the 
measured  power  spectra  to  the  reference  ACDM  curve  with 


o.oi  o.i 

k  [h  Mpc-1] 


Fig.  28. — Real-space  power  spectrum  Pgg{k)  for  galaxies  in  six  bins  of 
absolute  magnitude  Mg  \r  detailed  in  Table  1,  with  the  shading  indicating  1  a 
uncertainty.  All  power  spectra  have  roughly  the  same  shape,  increasing  in 
amplitude  as  the  galaxies  become  more  luminous.  The  dashed  curve  is  the 
best-fit  linear  ACDM  model  (see  text)  normalized  to  u%  =  1.  [See  the  elec¬ 
tronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


726 


TEGMARK  ET  AL. 


Vol.  606 


Fig.  29. — Bias  relative  to  the  linear  ACDM  model  of  Fig.  28  for  galaxies  in 
six  bins  of  absolute  magnitude  Mo.ir •  All  six  curves  are  consistent  with  being 
scale  independent,  the  bias  merely  increasing  in  amplitude  as  the  galaxies 
become  more  luminous.  [See  the  electronic  edition  of  the  Journal  for  a  color 
version  of  this  figure.] 


strongly  clustered  for  larger  luminosity  (smaller  absolute 
magnitude  M).  Let  us  consider  the  simple  bias  model 

6M(r)  =  b(M)6(r),  (26) 

where  6(r)  is  the  field  of  matter  fluctuations  and  b(M)  is  the 
luminosity-dependent  bias  factor  proportional  to  what  is 
plotted  in  Figure  30.  Since  our  observed  galaxy  sample  mixes 
galaxies  of  various  absolute  magnitudes  with  some  probability 
distribution /m{M\ r),  our  observed  density  field  can  be  written 

Sobs{r)=  J  fM(M;r)6M{r)dM.  (27) 

This  probability  distribution  /m(A7;  r)  is  simply  proportional 
to  the  galaxy  luminosity  function  <f>(M)  over  the  absolute 
magnitude  range  Mbri(r)  <  M  <  Mdim(r)  where  the  galaxy  is 
observable  at  comoving  distance  r,  zero  otherwise,  and  nor¬ 
malized  to  integrate  to  unity.  MbTfr)  and  Md;m(r)  are  given  by 
equation  (A5)  on  inserting  the  appropriate  absolute  mag¬ 
nitude  cuts  from  Table  1  (for  instance,  the  sample  safe  13  has 
Mnin  =  —23,  Mmax  =  —18.5).  Substituting  equation  (26)  into 
equation  (27),  we  obtain 


<W0  =  beS(r)6(r),  (28) 


where 


the  amplitude  freely  adjustable.  All  six  cases  produce  ac¬ 
ceptable  fits  with  reduced  x2  of  order  unity,  and  the 
corresponding  best-fit  normalizations  are  shown  in  Figure  30. 

We  want  our  reference  model  to  provide  an  accurate  em¬ 
pirical  characterization  of  the  SDSS  data  with  as  few  param¬ 
eters  as  possible.  We  choose  it  to  be  a  flat  scale-invariant 
ACDM  model  with  the  baryon  density  h 2  f =  0.024  favored 
by  WMAP  (Bennett  et  al.  2003)  and  the  Flubble  parameter 
h  =  0.72  favored  by  the  Hubble  Space  Telescope  Key  Project, 
leaving  0„,  as  the  only  free  parameter  determining  its  shape. 
We  determine  f by  the  following  iterative  procedure: 

1.  Given  an  ttm  value,  we  compute  the  reference  model 
P(k)  normalized  to  as  =  1. 

2.  Given  the  reference  model,  we  fit  for  the  six  bias  factors 
plotted  in  Figure  30. 

3.  We  fit  these  bias  factors  to  a  smooth  curve  b(M)/bt  = 
A  +  B(L/L *)  +  C(M  -  A/*)  shown  in  Figure  30  and  given  by 
the  three  parameters  (A,  B,  C). 

4.  We  compute  the  correction  beS(k)  for  scale-dependent 
bias  shown  in  Figure  22  {top)  as  described  below. 

5.  We  compute  the  value  of  Clm  that  best  fits  the  bias- 
corrected  measurements  in  Figure  22  (bottom). 

This  procedure  converges  to  within  floating-point  numerical 
precision  in  merely  a  few  iterations  for  starting  values  any¬ 
where  in  the  range  0.1  <  Clm  <  1.0,  yielding  Clm  =  0.300  and 
(A,B,C)  =  (0.895,0.150,-0.040).  The  basic  reason  for  this 
robustness  is  that  changing  the  shape  of  the  fiducial  model 
changes  bes(k)2  by  a  much  smaller  amount,  because  of  the 
smearing  by  the  integrals  below. 

5.2.  Correcting  for  the  Luminosity’  Dependence  of  Bias 

Above  we  quantified  the  well-known  fact  that  the  density 
field  6m  (r)  of  galaxies  of  absolute  magnitude  M  is  more 


freffM 


fM(M-r)b(M)  dM 


$(M) dM 


(29) 


Fig.  30. — Squares  show  the  measured  bias  of  our  SDSS  galaxies  as  a 
function  of  magnitude  relative  to  b *,  the  bias  at  M*  =  —20.83  {vertical  dotted 
line).  The  solid  curve  shows  the  fit  to  our  measurements  described  in  the  text, 
b{M)/K  =  0.895  +  0.150Z/Z,*  -  0.040(M  -  A/*),  and  the  dashed  curve 
shows  the  corresponding  fit  of  Norberg  et  al.  based  on  2dFGRS  data, 
b{M)/b *  =  0.85  +  0.15Z-/L*.  [See  the  electronic  edition  of  the  Journal  for  a 
color  version  of  this  figure.] 
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We  evaluate  this  expression  by  using  the  SDSS  luminosity 
function  measured  in  Blanton  et  al.  (2003c).  The  results  are 
plotted  in  Figure  31,  and  the  effective  bias  is  seen  to  increase 
with  distance  as  expected.  We  see  that  the  curve  bes(r)  becomes 
shallower  as  the  range  of  absolute  magnitudes  in  the  sample  is 
cut,  so  the  samples  safeO,  safe  13,  and  safe22  are  progressively 
less  affected.  Our  volume-limited  samples  by  construction 
simply  have  bes(r)  =  const. 

Bias  is  expected  to  depend  not  only  on  luminosity  but  also 
on  time  (Fry  1996;  Tegmark  &  Peebles  1998;  Giavalisco  et  al. 
1998;  Cen  &  Ostriker  2000;  Blanton  et  al.  2000).  In  addition, 
the  intrinsic  matter  clustering  should  increase  over  time.  Since 
the  net  result  of  these  two  counteracting  effects  is  likely  to  be 
smaller  than  the  luminosity  effect  at  the  low  redshifts  (z;S  0.2) 
that  we  are  considering,  this  is  difficult  to  measure  separately. 
The  same  applies  to  the  small  effect  from  the  time  dependence 
of  the  redshift-space  distortion  parameter  [3  caused  by  the  time 
dependence  of  Qm  and  fiA.  Indeed,  since  typical  luminosity 
grows  monotonically  with  distance,  the  distance-dependent 
bias  beff(r)  plotted  in  Figure  31  should  be  expected  to  ap¬ 
proximately  include  the  combination  of  all  four  effects,  so  that 
our  analysis  will  to  first  order  be  corrected  for  all  of  them. 

Let  us  now  estimate  how  bes(r)  translates  into  Ar-dependent 
bias  in  our  measured  power  spectrum.  We  do  this  in  the  FKP 
approximation  (Feldman  et  al.  1994).  Here  the  density  field 
bobs(r)  is  multiplied  by  a  weight  function 


and  then  Fourier  transformed,  giving 

j,{r)S0bsir)e-^^  =  J  (j>(r)bett{r)8{r)e~lk'r d3r  (31) 

because  of  equation  (28),  so  we  see  that  we  are  simply  mea¬ 
suring  the  power  spectrum  by  using  a  modified  effective 
weight  function,  replacing  (j)  by  (j)bes-  It  is  well  known  (see 
Tegmark  et  al.  1998  for  a  review  conforming  to  the  present 
notation)  that  the  FKP  estimate  P(k)  of  the  three-dimensional 
power  spectrum  P(k)  satisfies 

(P(k))  =  (W  *P)(k),  (32) 

i.e.,  that  it  probes  the  true  power  spectrum  convolved  with  a 
window  function  W(k).  This  window  function  is  the  square 
modulus  of  the  Fourier  transform  of  the  weight  function,  so  it 
is  given  by 

W0(k)  =  \4>(k)\2,  Wx{k)  =  \$beS(k)\2,  (33) 

where  Wb  applies  if  we  ignore  bias  and  W\  applies  if  we  take 
bias  into  account.  According  to  equation  (32),  galaxy  bias 
therefore  increases  the  measured  power  by  a  factor 

b m?  -  g CAW  _  JnvmtiPMK  ,34, 

(W„.P)(k)  JW„(k')P(k-k')d>  f  1 1 


4>(r)  cx 


n(r)P(k) 

1  +  h{r)P(k) 


(30) 


Fig.  31. — Effective  bias  (eq.  [29];  heavy  curves)  is  seen  to  increase  with 
distance,  reflecting  the  fact  that  more  distant  galaxies  are  on  average  more 
luminous.  The  curves  become  shallower  as  the  range  of  absolute  magnitudes 
in  the  sample  is  cut,  going  from  safeO  (no  cuts;  dashed  curve)  to  safe  13 
(—23  <  Mo.ir  <  —18.5;  solid  curve)  to  safe22  (—22  <  Mo.ir  <  — 18;  dotted 
curve).  Our  volume-limited  samples  simply  have  Z?eff  M  constant.  The  five  thin 
curves  show  the  relative  weights  given  to  different  distances  when  measuring 
P(k)  for  k  —  0.03,  0.1,  0.3,  1,  and  3  h  Mpc-1  by  using  the  safel3  radial 
selection  function.  [See  the  electronic  edition  of  the  Journal  for  a  color  ver¬ 
sion  of  this  figure.] 


The  window  function  is  normally  narrower  than  the  scale  on 
which  the  power  spectrum  varies  appreciably,  so  we  can  make 
the  approximation  of  taking  it  out  of  the  convolution  integral, 
obtaining  simply 


,  /,  x2  _  /  Wx  (k!)  d2k'  f  <j>{r)2beS{rf  d3r 

W)  f  Wo(k')  d^k'  f<p(r)2dh- 


(35) 


where  we  used  ParsevaFs  theorem 

J  \$(k)\2  d3k  =  (2n)3  J  \(j)(r)\2  d3r  (36) 


in  the  last  step.  In  summary,  substituting  equation  (30),  we  have 
shown  that  the  effective  bias  as  a  function  of  wavenumber  is 


bef[(k) 


'  f  (  b{r)P(k)  \ 

J  \  l  +h(r)P(k)J 

/  f  (  n{r)P{k) 

/  J  V  1  +  n(r)P(k) 


2 

beff(r)2r2  d  In  r 


r  d  In  r 


1/2 


(37) 


The  resulting  curves  beS(k)  are  plotted  in  Figure  32.  As 
expected,  the  effective  bias  is  seen  to  decrease  with  wave- 
number  k,  reflecting  the  fact  that  low-A:  measurements  are 
dominated  by  more  distant  and  luminous  galaxies.  Just  as  in  the 
previous  figure,  the  curves  become  shallower  going  from  safeO 
to  safel3  to  safe22,  as  the  range  of  absolute  magnitudes  shrinks. 

Note  that  if  one  treats  P(k)  as  a  constant  in  equation  (37), 
then  beg(k)  becomes  a  constant  independent  of  k.  Of  the 
magnitude-limited  galaxy  survey  analyses  of  the  last  decade, 
essentially  the  only  one  using  such  a  constant  weighting  was  the 
2dFGRS  analysis  by  Percival  et  al.  (2001).  Thus,  one  can 
minimize  the  luminosity  bias  at  the  expense  of  increased  sta¬ 
tistical  errors  due  to  suboptimal  galaxy  weighting  (as  shown  by 
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Fig.  32. — Effective  bias  in  the  power  spectrum  measurement,  eq.  (37),  is 
seen  to  decrease  with  wavenumber  k,  reflecting  the  fact  that  low-/:  measure¬ 
ments  are  dominated  by  more  distant  and  luminous  galaxies.  The  curves  be¬ 
come  shallower  as  the  range  of  absolute  magnitudes  in  the  sample  is  cut,  going 
from  safeO  (no  cuts;  thick  dashed  curve )  to  safe  13  (—23  <  Mo.ir  <  —18.5; 
solid  curve)  to  safe22  (—22  <  Mo.ir  <  — 18;  dotted  curve).  \See  the  electronic 
edition  of  the  Journal  for  a  color  version  of  this  figure .] 

Feldman  et  al.  1994,  such  uniform  weighting  is  desirable  only 
for  volume-limited  surveys  for  which  the  galaxy  number  den¬ 
sity  is  constant).  In  this  paper  we  have  instead  used  minimum- 
variance  methods  to  measure  the  luminosity  bias  and  power 
spectrum  jointly.  For  a  detailed  discussion  of  these  issues  that 
appeared  after  the  original  version  of  this  paper  was  submitted, 
see  Percival,  Verde,  &  Peacock  (2004). 

The  bottom  line  of  this  section  is  that  although  luminosity- 
dependent  bias  has  only  a  small  tilting  effect  on  our  measured 
SDSS  power  spectrum,  we  can  and  should  correct  for  it  when 
doing  precision  cosmology,  by  simply  dividing  the  measured 
power  spectrum  by  the  square  of  the  curve  in  Figure  22  (top). 
This  correction  curve  has  a  slope  around  —10%  per  decade  at 
k  ~  0.2  h  Mpc~'.  This  means  that  fitting  for  cosmological 
parameters  by  ignoring  this  effect  would  give  noticeably  bi¬ 
ased  results.  For  instance,  assuming  a  power-law  primordial 
power  spectrum  of  the  form  k"s,  this  would  correspond  to 
shifting  the  best-fit  spectral  index  ns  by  an  amount  A ns  s=s 
-(2/  In  10)  x  10%  «  —0.1,  and  a  more  careful  analysis  in 
§  7.3  shows  the  net  effect  to  be  —0.06.  To  place  this  in  context, 
the  popular  stochastic  eternal  inflation  model  (Linde  1994) 
predicts  ns  «  0.96,  i.e.,  a  substantially  smaller  departure  from 
the  ns  =  1  scale-invariant  case. 

6.  TESTS  FOR  SYSTEMATIC  ERRORS  IN  THE  DATA 

The  Monte  Carlo  experiments  described  in  §  4  provided  an 
end-to-end  validation  of  our  method  and  our  software.  In  this 
section,  we  turn  to  potential  systematic  errors  in  the  SDSS  data 
themselves.  Examples  of  such  effects  include  radial  modu¬ 
lations  (due  to  misestimates  of  evolution  or  W-corrections)  and 
angular  modulations  (due  to  effects  such  as  uncorrected  dust 
extinction,  variable  observing  conditions,  photometric  cali¬ 
bration  errors,  and  fiber  collisions)  of  the  density  field. 

6. 1 .  Analysis  of  Subsets  of  Galaxies 

Since  such  effects  would  be  expected  to  vary  across  the  sky 
(depending  on,  say,  reddening,  seasonally  variable  baseline 
offsets,  or  observing  conditions  such  as  seeing  and  sky 


brightness),  we  repeat  our  entire  analysis  for  four  different  an¬ 
gular  subsets  of  the  sky  (subsamples  A1-A4  from  Table  1)  in 
search  of  inconsistencies.  We  subtract  the  power  spectrum 
measured  south  of  the  Galactic  plane  (Al)  from  the  power 
spectrum  measured  north  of  the  Galactic  plane  (A2)  for  k  < 
0.2  h  Mpc~'  by  using  the  modeling  method  and  obtain  a  re¬ 
sidual  x2  ~  16  for  19  degrees  offreedom.  A  similar  comparison 
of  the  two  disjoint  northern  regions  (A3  and  A4)  gives  a  re¬ 
sidual  x2  ~  26,  again  for  19  degrees  offreedom.  Under  the  null 
hypothesis  that  a  such  a  pair  of  curves  are  independent  mea¬ 
surements  of  the  same  underlying  power  spectrum,  the  mean  and 
standard  deviation  are  (x2)  =  19  and  Ay2  =  (2  x  19)^2  «  6, 
so  these  residuals  are  —0.4  and  +1.4  standard  deviations  away 
from  the  expectation,  respectively.  In  other  words,  there  is  no 
significant  evidence  for  discrepancies  between  the  power 
spectra  measured  in  different  parts  of  the  sky. 

The  actual  residuals  are  shown  in  Figure  33.  Since  our 
measurements  in  different  £-bands  are  uncorrelated,  x2  is 
simply  the  sum  of  the  square  of  what  is  plotted.  The  most 
notable  discrepancy  is  at  k  ~  0.05  h  Mpc-1,  where  there  is 
more  power  in  A4  than  in  A3.  This  appears  to  be  related  to  a 
striking  wall-like  structure  that  is  seen  in  the  northern  galaxy 
distribution  around  z  ~  0.08  (see  Fig.  5).  Although  this  “great 
blob”  may  be  the  largest  coherent  structure  yet  observed,  a 
first  crude  estimate  suggests  that  it  is  not  inconsistent  with 
Gaussian  fluctuations:  visual  inspection  of  the  275  PTHalos 
simulations  reveals  similar  structures  in  more  than  10%  of  the 
cases.  A  similar  comparison  of  the  power  spectra  in  the  radial 
subsamples  R1-R3  is  less  useful,  since  this  radial  binning  is 
largely  degenerate  with  the  luminosity  binning  (§  5),  so  we 
test  for  radial  systematics  with  mode  subsets  instead. 

6.2.  Analysis  of  Subsets  of  Modes 

Because  of  their  angular  or  radial  nature,  all  potential  sys¬ 
tematic  errors  discussed  above  create  excess  power  mainly  in 


k  [h  Mpc-1] 

Fig.  33. — Comparison  of  power  in  different  parts  of  the  sky  defined  by  the 
angular  subsamples  A1-A4  in  Table  1.  Each  curve  shows  the  difference  of 
two  power  spectra  divided  by  the  error  bar  on  this  quantity,  so  x2  's  simply  the 
sum  of  the  square  of  what  is  plotted.  \-2  per  degree  of  freedom  is  0.86  for  A2- 
A1  (north  vs.  south)  and  1.39  for  A3-  A 4  (the  two  separate  northern  regions). 
[See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


No.  2,  2004 


3D  POWER  SPECTRUM  OF  GALAXIES  FROM  SDSS 


729 


the  radial  and  angular  modes.  To  quantify  any  such  excess,  we 
therefore  repeat  our  entire  analysis  with  all  233  special  modes 
(27  radial  modes,  199  angular  modes,  and  7  Local  Group 
modes)  deleted.  The  results  of  this  test  are  shown  in  Figure  34 
and  are  very  encouraging;  the  differences  are  tiny.  Any  sys¬ 
tematic  errors  adding  power  to  these  special  modes  would  cause 
the  squares  to  lie  systematically  above  the  crosses,  yet  the 
squares  fall  below  the  crosses  for  four  of  the  live  leftmost  bands, 
where  such  systematics  would  be  most  important.  Thus,  there  is 
no  indication  of  excess  radial  or  angular  power  in  the  data. 

Figure  34  shows  that  removing  the  special  modes  results  in 
a  slight  error  bar  increase  on  the  largest  scales  and  essentially 
no  change  on  smaller  scales.  This  can  be  readily  understood 
geometrically.  If  we  count  the  number  of  modes  that  probe 
mainly  scales  k  <  k*,  then  the  number  of  purely  radial,  purely 
angular,  and  arbitrary  modes  will  grow  as  kt,  k%,  and  kf  re¬ 
spectively.  This  means  that  “special”  modes  (radial  and  an¬ 
gular)  will  make  up  a  larger  fraction  of  the  total  pool  on  large 
scales  (at  small  k)  and  that  the  purely  radial  ones  will  be 
outnumbered  by  the  purely  angular  ones. 

Our  treatment  of  spectroscopic  fiber  collisions  described  in 
Appendix  A  is  another  source  of  potential  angular/radial 
problems.  By  assigning  both  members  of  some  close  pairs 
(separated  by  less  than  55",  corresponding  to  0.08  /?-1  Mpc  at 
the  mean  depth  of  the  survey)  at  the  same  redshift,  we  over¬ 
estimate  the  radial  power  on  small  scales.  Zehavi  et  al.  (2002) 
perform  extensive  tests  of  this  effect  and  show  that  it  is  neg¬ 
ligible  on  large  scales  considered  in  this  paper.  As  a  further 
cross-check,  we  repeat  our  entire  calculation  with  all  galaxies 
with  such  assigned  redshifts  removed.  Since  this  second  ap¬ 
proach  is  guaranteed  to  underestimate  the  power,  the  two 
approaches  will  bracket  the  correct  answer.  As  expected  on  the 


k  [h  Mpc-1] 

Fig.  34. — Effect  of  removing  special  modes.  The  black  curve  with  asso¬ 
ciated  error  bars  shows  our  measured  power  spectrum  Pgg{k)  from  the  mod¬ 
eling  method  using  all  4000  PKL  modes.  Red  squares  with  error  bars  show  the 
effect  of  removing  the  234  special  modes  corresponding  to  purely  angular  and 
purely  radial  fluctuations,  as  well  as  fluctuations  associated  with  the  motion  of 
the  Local  Group  relative  to  the  CMB  rest  frame.  Any  systematic  errors  adding 
power  to  these  special  modes  would  cause  the  curve  to  lie  systematically 
above  the  squares.  \See  the  electronic  edition  of  the  Journal  for  a  color 
version  of  this  figure .] 


basis  of  the  Zehavi  et  al.  (2002)  analysis,  we  find  no  evidence 
that  fiber  collisions  are  boosting  our  measured  power  spec¬ 
trum  on  the  smallest  scales  we  probe  (k  ~  0.3  h  Mpc-1). 

7.  DISCUSSION  AND  CONCLUSIONS 
7. 1 .  Basic  Results 

We  have  measured  the  shape  of  the  real-space  power 
spectrum  P(k)  on  large  scales  by  using  the  SDSS  galaxy 
redshift  catalog,  paying  particular  attention  to  quantifying  and 
correcting  for  the  effects  of  survey  geometry,  redshift-space 
distortions,  and  luminosity-dependent  bias.  Our  principal 
results  are  the  estimates  Pgg(k)  of  the  real-space  galaxy  power 
spectrum,  which  are  listed  in  Tables  2  and  3  for  the  disen¬ 
tanglement  and  modeling  methods,  respectively.  As  discussed 
in  §  4,  the  disentanglement  method  is  more  robust,  but  the 
modeling  method  (Fig.  22)  yields  smaller  statistical  errors  and 
appears  in  our  tests  to  introduce  little  systematic  error.  Table  3 
lists  results  both  for  FOG  compression  with  Sc  =  200,  which 
we  consider  the  most  reliable  choice  for  estimating  the  true 
real-space  power  spectrum,  and  for  no  FOG  compression, 
which  is  the  case  easiest  to  model  in  detail.  Our  estimation 
procedure  yields  uncorrelated  error  bars,  so  the  reported  errors 
in  these  tables  can  be  used  as  a  diagonal  covariance  matrix 
when  evaluating  the  likelihood  for  model  fits.  For  such  fits,  it 
is  crucial  to  use  the  exact  window  functions.31 

As  noted  in  §  4,  uncertainties  in  the  values  of  /3  and  r  leave 
a  4%  1  a  uncertainty  in  the  overall  normalization  of  Pgg(k) 
with  the  modeling  method,  in  addition  to  the  error  bars  on 
individual  points.  There  is  no  corresponding  normalization 
uncertainty  for  the  disentanglement  method.  Our  tabulated 
power  measurements  have  all  been  corrected  for  the  effect  of 
luminosity-dependent  bias  as  discussed  in  §  5.  The  correction 
b(k)  used  is  given  in  Tables  2  and  3  and  is  normalized  so  that 
our  quoted  power  measurements  represent  the  power  spectrum 
of  galaxies  with  absolute  r-band  magnitude  M*  «  —20.83;  the 
relative  bias  of  galaxies  as  a  function  of  luminosity  can  be 
found  in  Figure  30. 

7.2.  Using  Our  Results 

There  are  several  levels  at  which  one  might  use  our  results, 
depending  on  how  conservative  one  wishes  to  be  and  how 
much  energy  one  has  for  theoretical  modeling.  The  mock 
catalog  tests  in  §  4.2  (Fig.  27  in  particular)  suggest  that  our 
method  is  quite  successful  at  correcting  for  redshift-space 
distortions  to  recover  the  real-space  galaxy  power  spectrum 
Pt™e(k).  However,  there  are  notable  departures  from  perfect 
recovery  at  k  ^0.15  h  Mpc-1,  and  the  tests  are  in  any  event 
carried  out  for  a  particular  choice  of  cosmology  and  galaxy 
bias  model.  The  simplest  and  least  conservative  way  to  use 
our  results  is  to  assume  that  we  have  indeed  recovered  P^e(k) 
and  to  further  assume  that  on  the  scales  of  our  measurement 
the  galaxy  power  spectrum  is  a  scale-independent  multiple  of 
the  linear  theory  matter  power  spectrum,  Pgg(k )  =  b\P{k), 
where  b%  represents  the  large-scale  bias  factor  of  galaxies. 
The  agreement  of  the  lines  representing  Pgg(k)  and  the  linear 
P(k)  in  Figure  27  suggests  that  this  approach  is  probably  safe 
for  k  Si  0.1  h  Mpc-1  and  that  one  can  use  the  more  precise 
modeling  estimates  of  Pgg{k)  (Table  3)  without  incurring  a 
systematic  error  that  is  significant  relative  to  the  statistical 

31  The  window  functions  are  available  at  http://www.hep.upenn.edu/~max/ 
sdss.html,  together  with  sample  software  for  evaluating  the  SDSS  likelihood 
function. 
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Fig.  35. — Comparison  with  other  galaxy  power  spectrum  measurements. 
Numerous  caveats  must  be  borne  in  mind  when  this  figure  is  interpreted.  Our 
SDSS  power  spectrum  measurements  are  those  from  Fig.  22,  corrected  for  the 
red-tilting  effect  of  luminosity-dependent  bias.  The  purely  angular  analyses  of 
the  APM  survey  (Efstathiou  &  Moody  2001)  and  the  SDSS  (the  points  are 
from  Tegmark  et  al.  2002  for  galaxies  in  the  magnitude  range  21  <  r*  <  22 — 
see  also  Dodelson  et  al.  2002)  should  also  be  free  of  this  effect,  but  represent 
different  mixtures  of  luminosities.  The  2dFGRS  points  are  from  the  analysis  of 
HTX02  and,  like  the  PSCz  points  (HTP00)  and  the  UZC  points  (THX02), 
have  not  been  corrected  for  this  effect,  whereas  the  Percival  et  al.  2dFGRS 
analysis  should  be  unafflicted  by  such  red-tilting.  The  influential  PD94  points 
(Table  1  from  Peacock  &  Dodds  1994),  summarizing  the  state  of  the  art  a 
decade  ago,  are  shown  assuming  IRAS  bias  of  unity  and  the  then  fashionable 
density  parameter  =  1.  [See  the  electronic  edition  of  the  Journal  for  a 
color  version  of  this  figure.] 

errors  of  the  current  measurement.  However,  our  tests  are  not 
exhaustive,  and  it  is  possible  that  the  agreement  of  Pgg(k)  and 
linear  P(k)  shapes  in  Figure  27  arises  in  part  from  a  cancel¬ 
lation  of  nonlinear  gravitational  effects  with  errors  in  the 
redshift-space  distortion  correction.  This  cancellation,  in  turn, 
might  not  hold  for  other  cosmological  or  galaxy  bias  models. 

A  second  option  is  to  assume  that  we  have  recovered  P^e(k) 
but  not  assume  that  this  has  the  same  shape  as  the  linear 
theory  matter  power  spectrum.  Here,  for  example,  one  could 
use  H-body  simulations  or  analytic  approximations  to  compute 
the  nonlinear,  real-space  power  spectrum,  incorporating  gal¬ 
axy  bias  based  on  semianalytic  galaxy  formation  calculations, 
hydrodynamic  simulations,  or  a  “halo  occupation”  model 
constrained  by  other  measurements  of  galaxy  clustering. 
Figure  27  again  suggests  that  this  approach  can  be  used  safely 
for  k  Si  0.1  h  Mpc~'  (and  perhaps  a  bit  further)  without  sys¬ 
tematic  errors  that  exceed  the  statistical  errors.  One  can  also 
use  the  nonlinear  matter  power  spectrum  and  a  constant  b,  but 
there  is  good  reason  to  expect  scale-dependent  bias  on  scales 
where  nonlinearity  is  significant  (Hamilton  &  Tegmark  2002). 
Finally,  the  most  cumbersome  but  most  reliable  way  to  use  our 
data  is  to  follow  the  course  suggested  at  the  end  of  §  4:  create 
redshift-space  realizations  using  nonlinear  simulations  with 
galaxy  bias,  compute  the  monopole,  quadrupole,  and  hexa- 
decapole  components  of  the  redshift-space  power  spectrum  in 
the  distant  observer  approximation,  and  use  equation  (17)  to 
convert  them  to  Pgg{k).  These  predictions  should  be  compared 


directly  with  the  disentanglement  estimates  of  Pgg(k),  since  the 
redshift-space  distortions  have  been  incorporated  into  the 
model  rather  than  removed  from  the  data.  However,  by  fo¬ 
cusing  on  a  quantity  Pgg(k )  that  responds  mainly  to  real-space 
clustering  (exactly  so  in  the  linear  regime),  such  a  comparison 
will  be  insensitive  to  moderate  errors  in  the  redshift-space 
distortions  incorporated  in  the  theoretical  predictions.  This  last 
approach  is  still  much  simpler  than  creating  artificial  SDSS 
catalogs  and  reproducing  our  estimation  method  in  its  entirety, 
but  it  should  be  equally  valid. 

7.3.  Comparison  with  Other  Results 

Figure  35  compares  our  results  from  Table  3  (modeling 
approach)  with  other  measurements  from  galaxy  surveys  but 
must  be  interpreted  with  care.  The  UZC  points  may  contain 
excess  large-scale  power  because  of  selection  function  effects 
(Padmanabhan,  Tegmark,  &  Hamilton  2001;  THX02),  and  the 
angular  SDSS  points  measured  from  the  early  data  release 
sample  are  difficult  to  interpret  because  of  their  extremely 
broad  window  functions.  Only  the  SDSS,  APM,  and  angular 
SDSS  points  can  be  interpreted  as  measuring  the  large-scale 
matter  power  spectrum  with  constant  bias,  since  the  others 
have  not  been  corrected  for  the  red-tilting  effect  of  luminosity- 
dependent  bias.  The  Percival  et  al.  (2001)  2dFGRS  analysis 
unfortunately  cannot  be  directly  plotted  in  the  figure  because 
of  its  complicated  window  functions. 

Figure  36  is  the  same  as  Figure  35  but  restricted  to  a  com¬ 
parison  of  decorrelated  power  spectra,  those  for  SDSS, 
2dFGRS,  and  PSCz.  Because  the  power  spectra  are  decorre¬ 
lated,  it  is  fair  to  do  by  eye”  when  examining  this  figure.  The 
similarity  in  the  bumps  and  wiggles  between  the  three  power 
spectra  is  quite  striking.  Moreover,  there  is  an  interesting  degree 
of  similarity  with  the  power  spectrum  of  the  Abell/ACO  cluster 
catalog  (not  shown)  reported  by  Miller  &  Batuski  (2001).  It  is 
tempting  to  see  hints  of  baryonic  oscillations  in  these  wiggles. 


k  [h  Mpc-1] 

Fig.  36. — Same  as  Fig.  35,  but  restricted  to  a  comparison  of  decorrelated 
power  spectra,  those  for  SDSS,  2dFGRS,  and  PSCz.  The  similarity  in  the 
bumps  and  wiggles  between  the  three  power  spectra  is  intriguing.  [See  the 
electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 


No.  2,  2004 


3D  POWER  SPECTRUM  OF  GALAXIES  FROM  SDSS 


731 


0.6 


0  0.2  0.4  0.6 

hQm 


Fig.  37. — Characterization  of  the  SDSS  power  spectrum  in  terms  of  con¬ 
straints  on  the  “shape  parameter”  h  £lm  and  the  baryon  fraction  fa.  Black 
points  show  100,000  Markov  chain  steps  for  SDSS;  gray  points  are  for  WMAP 
data.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this 
figure.} 

Indeed,  Percival  et  al.  (2001),  in  their  analysis  of  the  2dFGRS, 
and  Miller  et  al.  (2001a,  2001b;  see  also  Miller  et  al.  2002),  in 
their  analysis  of  the  Abell/ACO  cluster  (Miller  &  Batuski 
2001),  APM  cluster  (Tadros,  Efstathiou,  &  Dalton  1998),  and 
PSCz  surveys  (HTP00),  already  concluded  that  their  data 
mildly  favored  model  power  spectra  with  baryonic  oscillations 
over  those  without.  However,  the  oscillations  at  large  scales 
evident  in  Figure  36,  notably  the  dip  at  k  ~  0.035  h  Mpc~*  and 
bump  at  k  ~  0.05  h  Mpc  ,  are  substantially  larger  than  pre¬ 
dicted  by  the  standard  ACDM  concordance  model  illustrated  in 
Figure  22;  if  confirmed,  such  a  feature  would  challenge  the 
ACDM  concordance  model  with  scale-invariant  initial  con¬ 
ditions.  This  preference  for  a  large  baryon  fraction  is  also  seen 
in  Figure  37  (details  below),  which,  however,  shows  that  our 
SDSS  data  are  nonetheless  perfectly  consistent  with  the  con¬ 
cordance  baryon  fraction — about  one-sixth  of  the  100,000 
points  shown  have  a  baryon  fraction  below  the  WMAP  value 
of  17%. 

Figure  37  also  illustrates  why  galaxy  clustering  data  is  so 
complementary  to  CMB  measurements.  The  100,000  gray 
points  are  from  a  Monte  Carlo  Markov  chain  analysis  of  the 
WMAP  for  simple  flat  scalar  adiabatic  models  parameterized  by 
the  densities  of  dark  energy,  dark  matter,  and  baryronic  matter, 
the  spectral  index  and  amplitude,  and  the  reionization  optical 
depth.  As  emphasized  by  Eisenstein  et  al.  (1999)  and  Bridle 
et  al.  (2003),  WMAP  alone  cannot  determine  f2„,  to  better  than  a 
factor  of  2  or  so  because  of  a  strong  degeneracy  with  other 
parameters.  Fortunately,  the  WMAP  degeneracy  banana  in 
Figure  37  is  seen  to  be  almost  orthogonal  to  the  SDSS  degen¬ 
eracy,  which  means  that  combining  the  two  measurements 
dramatically  tightens  the  constraints  on  all  the  parameters  in¬ 
volved  in  the  degeneracy — notably  f im,  h,  and  er8. 

To  place  our  SDSS  results  in  a  larger  context,  Figure  38 
compares  them  with  other  measurements  of  the  matter  power 


spectrum P{k).  Here  the  CMB,  galaxy  cluster,  lensing,  and  Lyo 
forest  results  have  been  mapped  into  A'-space  using  the  method 
of  Tegmark  &  Zaldarriaga  (2002),  assuming  the  WMAP  model 
given  in  the  legend,  and  we  have  assumed  an  SDSS  bias  Z>*  =  1. 
The  CMB  data  combine  the  BOOMERANG,  MAXIMA, 
Degree  Angular  Scale  Interferometer  (DASI),  Cosmic  Back¬ 
ground  Imager,  Very  Small  Array,  Arcminute  Cosmology  Bo¬ 
lometer  Array  Receiver  (ACBAR),  and  WMAP  data  as  in  Wang 
et  al.  (2003)  with  the  WMAP  measurements  (Hinshaw  et  al. 
2003).  The  cluster  point  reflects  the  spread  in  the  recent  liter¬ 
ature  rather  than  any  one  quoted  measurement.  The  lensing  data 
are  from  Hoekstra,  Yee,  &  Gladders  (2002).  The  Lyo  forest 
points  are  from  the  Gnedin  &  Hamilton  (2002)  reanalysis  of  the 
Croft  et  al.  (1999)  data. 

We  leave  detailed  investigation  of  the  implications  of  our 
measurement  to  other  papers  (by  us  in  Paper  II  and,  we  hope, 
by  others),  since  the  primary  goal  of  this  paper  is  the  mea¬ 
surement  itself.  As  a  characterization  of  our  data,  we  briefly 
indulge  in  the  simplest  of  the  interpretive  approaches  de¬ 
scribed  in  §  7.2.  For  this  purpose,  we  fit  our  22  Pgg(k ) 
measurements  derived  from  the  modeling  method  with  8C  = 
200  by  using  the  linear  CDM  power  spectrum  of  Eisenstein  & 
Hu  (1999),  fixing  the  baryon  fraction  =  0.17  and  the 

Hubble  parameter  h  =  0.72  as  per  the  best  fits  from  WMAP 
(Bennet  et  al.  2003;  Spergel  et  al.  2003;  Verde  et  al.  2003)  and 
no  massive  neutrino  contribution.  If  we  further  fix  the  infla¬ 
tionary  spectral  index  to  ns  =  1,  then  the  shape  of  P(k) 
is  determined  by  the  combination  hQm,  and  we  find  hflm  = 
0.201  ±0.017  at  1  cr,  i.e.,  Om  =  0.300±0.018. 

As  discussed  above,  our  modeling  of  nonlinear  redshift- 
space  distortions  is  accurate  only  on  large  scales,  so  we 
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Fig.  38. — Comparison  of  our  results  with  other  P(k)  constraints.  The  lo¬ 
cation  of  CMB,  cluster,  lensing,  and  Lya  forest  points  in  this  plane  depends 
on  the  cosmic  matter  budget  (and,  for  the  CMB,  on  the  reionization  optical 
depth  r),  so  requiring  consistency  with  SDSS  constrains  these  cosmological 
parameters  without  assumptions  about  the  primordial  power  spectrum. 
This  figure  is  for  the  case  of  a  vanilla  flat  scalar  scale-invariant  model  with 
=  0.28,  h  —  0.72,  and  =  0.16,  r  =  0.17  (Spergel  et  al.  2003; 

Verde  et  al.  2003;  Tegmark  et  al.  2003),  assuming  6*  =  0.92  for  the  SDSS 
galaxies.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this 
figure.] 
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recommend  not  using  the  measurements  with  k  >  0.2  h 
Mpc-1  for  cosmological  analysis.  In  this  spirit,  we  fit  the 
19  Pgg(k)  measurements  for  k  <  0.2  h  Mpc~'  to  the  two- 
parameter  model  defined  by  the  Smith  et  al.  (2003)  nonlinear 
power  spectrum  approximation  by  using  the  Eisenstein  &  Hu 
(1999)  fit  as  above  for  the  linear  transfer  function,  fixing  the 
bias  b *  =  1.  This  gives  the  shape  parameter  Mlm  =  0.213  ± 
0.0233  at  1  tr,  i.e.,  f \m  =  0.295  ±  0.0323.  This  fit  has  \2  = 
15.6  for  19  —  2  =  17  effective  degrees  of  freedom,  so  this  two- 
parameter  model  fit  can  be  regarded  as  an  adequate  represen¬ 
tation  of  our  results  in  compact  summary  form.  (Using  a  linear 
power  spectrum  model  would  increase  this  f lm  value  by  0.043.) 
Figure  37,  which  was  commented  on  above,  shows  the  result  of 
repeating  this  same  fit  after  adding  the  baryon  fraction  as  a 
third  free  parameter.  Fixing  the  best-fit  shape  hit,,,  =  0.213,  the 
power  spectrum  amplitude  corresponds  to  erg  =  0.89  ±  0.02 
for  L*  galaxies  after  marginalizing  over  the  redshift-space 
distortion  parameters  (3  and  r.  Note  that  this  normalization  is  at 
the  effective  redshift  of  the  survey,  not  for  z  =  0  galaxies. 

If  we  fix  Qm  at  the  best-fit  value  0.291  and  treat  the  spectral 
index  as  the  free  shape  parameter,  then  we  find  ns  =  0.995  ± 
0.049.  Without  our  correction  for  luminosity-dependent  bias, 
the  corresponding  numbers  are  ns  =  0.933  ±  0.046,  so  the 
statistical  errors  are  now  small  enough  for  effects  such  as 
this  to  be  important.  Similarly,  Table  7  of  Paper  II  shows 
that  ignoring  this  correction  reduces  the  measured  value  of 
Om  by  0.035. 

Paper  II  presents  a  thorough  analysis  of  the  cosmological 
constraints  from  our  P(k)  measurement,  finding  them  in  good 
agreement  with  a  “vanilla”  flat  adiabatic  ACDM  model  with 
negligible  tilt,  running  tilt,  tensor  modes,  or  massive  neutri¬ 
nos.  Our  P(k )  measurement  provides  a  powerful  confirmation 
of  the  results  reported  by  the  WMAP  team  and  more  than 
halves  the  WMAP- only  error  bars  on  some  parameters,  e.g., 
the  matter  density  1  and  the  Hubble  parameter  h.  Paper  II 
finds  0,„  =  0.30  ±  0.04  from  RMHP+SDSS  when  marginal¬ 
izing  over  the  other  vanilla  parameters.  This  is  about  1  a 
higher  than  when  using  the  2dFGRS  survey  [which  gave  a 
slightly  redder  P(k)  slope  than  we  found] — just  the  sort  of 
statistical  difference  one  would  expect  from  two  completely 
independent  samples. 

7.4.  Outlook 

Let  us  conclude  by  looking  ahead.  Galaxy  surveys  have  the 
potential  to  greatly  improve  the  cosmological  constraints  from 
the  cosmic  microwave  background  by  breaking  degeneracies 
and  providing  cross-checks,  so  detailed  joint  analysis  of  our 
measurements  with  WMAP  and  other  data  sets  will  be 
worthwhile.  In  particular,  detecting  the  effect  of  baryons  on 
P(k)  (Tegmark  1997a;  Goldberg  &  Strauss  1998)  can  provide 
powerful  constraints  on  the  Hubble  parameter  (Eisenstein,  Hu, 
&  Tegmark  1998),  and  accurate  determination  of  the  shape  of 
P(k)  can  place  strong  constraints  on  neutrino  masses  (Hu, 
Eisenstein,  &  Tegmark  1998;  Spergel  et  al.  2003;  Hannestad 
2003)  and  help  pin  down  the  primordial  power  spectrum. 
Deeper  surveys  can  also  provide  interesting  constraints  on  the 
evolution  of  clustering  and  dark  energy,  and  the  SDSS  lumi¬ 
nous  red  galaxy  (LRG)  sample  and  photometric  redshift  cat¬ 
alog  will  complement  specialized  deep  redshift  surveys  such 
as  DEEP  (Davis  et  al.  2001)  and  V1RMOS  (Le  Fevre  et  al. 
2001)  in  this  regard. 


Prospects  are  also  good  for  reducing  systematic  uncertain¬ 
ties  involving  both  bias  and  redshift  distortions.  A  key  virtue 
of  having  very  large  galaxy  samples  is  that  it  permits  accurate 
measurements  for  large  numbers  of  subsamples.  For  instance, 
repeating  our  analysis  for  subsamples  based  on  galaxy  color 
or  spectral  type  will  provide  a  powerful  test  of  how  scale 
independent  the  bias  is  on  large  scales.  Moreover,  empirical 
constraints  from  SDSS  on  redshift-space  distortions  should 
improve  substantially.  These  constraints  are  currently  rather 
weak  because  the  survey  geometry  consists  largely  of  thin 
wedges;  we  have  therefore  focused  simply  on  modeling  dis¬ 
tortions  well  enough  to  remove  their  impact  on  the  real-space 
P(k)  estimate.  As  the  survey  area  fills  in  and  becomes  more 
contiguous,  we  expect  to  obtain  interesting  constraints  on 
redshift  distortions  that  can  be  used  to  test  and  refine  theo¬ 
retical  and  numerical  models. 

In  addition  to  more  careful  modeling  and  combining  with 
other  observational  constraints,  we  anticipate  several  com¬ 
plementary  results  from  the  SDSS  in  the  near  future,  such  as 
cosmological  constraints  directly  from  KL  modes  (Pope  et  al. 
2004),  real-space  clustering  on  small  scales  from  the  projected 
correlation  function  w(rp)  (I.  Zehavi  et  al.  2004,  in  prepara¬ 
tion),  power  spectrum  measurements  on  large  scales  by  using 
the  luminous  red  galaxy  sample,  and  angular  clustering 
measurements  by  using  photometric  redshifts  (Budavari  et  al. 
2003).  This  should  help  break  degeneracies  and  provide  cross¬ 
checks  to  test  rather  than  assume  the  physics  underlying  the 
cosmological  model,  thereby  strengthening  the  weakest  link  in 
post  -WMAP  cosmology. 
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APPENDIX  A 

DATA  AND  DATA  MODELING 

In  this  Appendix,  we  provide  a  detailed  description  of  how  our  basic  galaxy  sample  was  processed,  modeled,  and  split  into  the 
subsamples  used  in  our  power  spectrum  analysis. 

Al.  THE  SDSS  GALAXY  CATALOG 

The  SDSS  (York  et  al.  2000)  is  producing  imaging  and  spectroscopic  surveys  over  about  a  quarter  of  the  celestial  sphere.  A 
dedicated  2.5  m  telescope  at  Apache  Point  Observatory,  Sunspot,  New  Mexico,  images  the  sky  in  five  bands  between  3000  and 
10,000  A  (u,  g,  r,  i,  and  z;  Fukugita  et  al.  1996)  by  using  a  drift-scanning,  mosaic  CCD  camera  (Gunn  et  al.  1998),  detecting  objects 
to  a  flux  limit  of  r  ~  22.5.  The  photometric  quality  of  the  observations  is  tracked  by  using  an  automatic  photometricity  monitor 
(Hogg  et  al.  2001).  One  of  the  goals  is  to  spectroscopically  observe  900,000  galaxies  (down  to  Him  ~  17.77;  Strauss  et  al.  2002), 
100,000  LRGs  (Eisenstein  et  al.  2001),  and  100,000  QSOs  (Richards  et  al.  2002).  This  spectroscopic  follow-up  uses  two  digital 
spectrographs  (A.  Uomoto  et  al.  2004,  in  preparation)  on  the  same  telescope  as  the  imaging  camera.  Other  aspects  of  the  survey  are 
described  in  the  Early  Data  Release  (EDR)  paper  (Stoughton  et  al.  2002). 

The  SDSS  images  are  reduced,  and  catalogs  are  produced  by  the  SDSS  pipeline  photo  (Lupton  et  al.  2001),  which  detects  and 
measures  objects,  the  sky  background,  and  the  seeing  conditions.  As  described  in  Smith  et  al.  (2002),  magnitudes  are  calibrated  to 
a  standard  star  network  approximately  in  the  AB  system.  The  astrometric  calibration  is  also  performed  by  an  automatic  pipeline, 
which  obtains  absolute  positions  to  better  than  0"1  rms  per  coordinate  (Pier  et  al.  2003). 

Object  fluxes  are  determined  in  several  different  ways  by  photo,  as  described  in  Stoughton  et  al.  (2002).  The  primary  measure  of 
flux  used  for  galaxies  is  the  SDSS  Petrosian  magnitude,  a  modified  version  of  the  quantity  proposed  by  Petrosian  (1976).  In  the 
absence  of  seeing,  Petrosian  magnitudes  measure  a  constant  fraction  of  a  galaxy’s  light  regardless  of  distance  (or  size).  They  are 
described  in  greater  detail  by  Blanton  et  al.  (2001)  and  Strauss  et  al.  (2002).  Another  important  measure  of  flux  for  galaxies  is  the 
SDSS  model  magnitude,  which  is  an  estimate  of  the  magnitude  by  using  the  better  of  a  de  Vaucouleurs  and  an  exponential  fit  to  the 
image.  Finally,  the  SDSS  also  measures  the  flux  in  each  object  by  using  the  local  point-spread  function  (PSF)  as  a  template,  which 
is  the  highest  signal-to-noise  ratio  measurement  of  flux  for  point  sources. 

Main  sample  target  selection  (Strauss  et  al.  2002)  involves  star/galaxy  separation,  application  of  the  flux  limit,  application  of  the 
surface  brightness  limit,  and  application  of  the  fiber  magnitude  limit.  Expressed  quantitatively,  the  first  three  of  these  criteria  are 


'  PSF  '"mod  ^  .S' I  j  i n  ■ 

F|YI  <  Him; 

/Bo  <  /Bo.lirn,  (Al) 

where  rPet  is  the  dereddened  Petrosian  magnitude  in  the  r  band  (by  using  the  dust  maps  of  Schlegel,  Finkbeiner,  &  Davis  1998), 
'mod  is  the  model  magnitude,  rpsF  is  the  PSF  magnitude,  and  /. 1 50  is  the  Petrosian  half-light  surface  brightness  of  the  object  in  the 
r-band.  In  practice,  the  values  of  the  target  selection  parameters  vary  across  the  survey  in  a  well-understood  way,  but  for  the  bulk  of 
the  area,  they  are  Jnm  =  0.3,  r\im  =  17.17,  and  /u50  Um  =  24.5.  We  note  here  that  objects  near  the  spectroscopic  flux  limit  are  nearly 
5  mag  brighter  than  the  photometric  limit;  that  is,  the  fluxes  are  measured  at  a  signal-to-noise  ratio  of  a  few  hundred. 

Fibers  are  assigned  to  a  set  of  circular  tiles  with  a  field  of  view  1?49  in  radius  by  an  automatic  tiling  pipeline  (Blanton  et  al. 
2003b).  The  targets  are  observed  using  a  640  fiber  spectrograph  on  the  same  telescope  as  the  imaging  camera.  We  extract  one¬ 
dimensional  spectra  from  the  two-dimensional  spectrograms  by  using  a  pipeline  (idlspec2d)  created  specifically  for  the  SDSS 
instrumentation  (D.  Schlegel  et  al.  2004,  in  preparation);  a  further  pipeline  (specBS  v4_9)  fits  for  the  redshift  of  each  spectrum. 
The  official  SDSS  redshifts  are  obtained  from  a  different  pipeline  (J.  Frieman  et  al.  2004,  in  preparation).  The  two  independent 
versions  provide  a  consistency  check  on  the  redshift  determination.  They  are  consistent  (for  galaxies)  at  over  the  99%  level. 

Fibers  on  a  single  plate  cannot  be  placed  more  closely  than  55".  Thus,  redshifts  for  both  members  of  a  close  galaxy  pair  can  be 
obtained  only  in  regions  where  tiles  overlap.  If  we  did  not  take  fiber  collisions  into  account  at  all,  we  would  systematically 
underestimate  correlations  on  all  scales.  We  correct  for  this  bias  by  assigning  each  galaxy  pair  member  whose  redshift  was  not 
obtained  because  of  a  fiber  collision  the  same  redshift  as  the  pair  member  whose  redshift  was  measured.  Thus,  for  192,642  of  the 
galaxies  in  the  full  sample,  a  spectroscopic  redshift  is  available,  but  for  12,801  (~6%)  we  must  assign  redshifts  according  to  this 
prescription.  By  using  the  overlaps  of  multiple  tiles,  where  many  of  these  pairs  can  be  recovered,  Zehavi  et  al.  (2002)  have  shown 
that  this  procedure  works  well  on  large  scales,  and  we  confirm  this  conclusion  with  additional  tests  in  §  6. 

As  of  July  2002,  the  SDSS  had  imaged  and  targeted  2873  deg2  of  sky  and  taken  431,291  successful  spectra  (including  323,126 
spectra  of  galaxies)  over  2499  deg2  of  that  area.  We  created  a  well-defined  sample  for  calculating  large-scale  structure  and  galaxy 
property  statistics  from  these  data,  known  as  Large-Scale  Structure  sample  1 1 .  Our  sample  1 1  consists  of  all  of  the  photometry  for  all 
of  the  targets  over  that  area  (as  extracted  from  the  internal  SDSS  operational  database),  all  of  the  spectroscopic  results  (as  output 
from  idlspec2d),  and  a  description  of  the  angular  window  function  of  the  survey  and  the  flux  and  surface  brightness  limits  used  for 
galaxies  in  each  area  of  the  sky  (discussed  more  fully  below).  For  most  of  the  area,  we  used  the  same  version  of  the  analysis 
software  that  was  used  to  create  the  target  list.  However,  for  the  area  covered  by  the  EDR  (Stoughton  et  al.  2002)  we  used  the 
version  of  the  analysis  software  used  for  that  data  release,  since  it  was  substantially  better  than  the  early  versions  of  the  software 
used  to  target  that  area.  For  PHOTO,  the  most  important  piece  of  analysis  software  run  on  the  data,  the  versions  used  for  the 
photometry  range  from  v5_0  to  v5_2.  The  region  covered  by  this  sample  is  similar  to,  but  somewhat  greater  than,  the  region  that 
will  be  released  in  the  SDSS  Data  Release  1  (DR1;  which  will  use  a  newer  version  of  the  software,  which  among  other  things 
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improves  the  handling  of  large  galaxies).  For  all  of  the  subsamples  of  samplell  defined  here,  we  define  a  bright  limit  of  r  =  14.5 
(using  Petrosian,  dereddened  magnitudes)  to  exclude  the  spectroscopic  bright  limits  imposed  in  galaxy  target  selection,  as  well  as 
the  possibility  of  poor  deblending  of  bright  objects  by  PFIOTO  (a  problem  for  v5_2  and  previous,  though  less  so  for  v5_3). 

We  measure  galaxy  magnitudes  through  a  set  of  bandpasses  that  are  constant  in  the  observer  frame.  These  observer  frame 
magnitudes  correspond  to  different  rest-frame  magnitudes  depending  on  the  galaxy  redshift.  To  compare  galaxies  observed  at 
different  redshifts,  we  convert  all  the  magnitudes  to  a  single  fixed  set  of  bandpasses  by  using  the  method  of  Blanton  et  al.  (2003a; 
kcorrect  vl_l  1 ).  These  routines  fit  a  linear  combination  of  four  spectral  templates  to  each  set  of  five  magnitudes.  The  coefficient  ao 
of  the  first  template  is  an  estimate  of  the  flux  in  the  rest-frame  optical  range  (3500A  <  X  <  7500A),  while  the  fractional 
contribution  of  the  other  coefficients  fli/flo,  «2/«o,  and  a^/ao  characterizes  the  spectral  energy  distribution  (SED)  of  the  galaxy. 
The  most  significant  variation  is  along  ai/ao-  Taking  the  sum  of  the  templates  and  projecting  it  onto  filter  responses,  we  can 
calculate  the  A'-corrections  from  the  observed  bandpass  to  any  rest-frame  bandpass.  In  practice,  for  the  rare  galaxies  in  our  sample 
around  redshift  z  ~  0.3  this  procedure  is  unstable.  Thus,  for  this  sample  we  fix  the  coefficients  for  all  galaxies  at  redshifts  z  >  0.28 
to  the  average  value  for  galaxies  between  redshifts  0.26  <  z  <  0.28,  corresponding  to  the  typical  red  luminous  galaxy  SED.  The 
median  redshift  of  the  sample  is  0.1,  so  in  this  paper  we  quote  absolute  magnitude  rest- frame  bandpasses  shifted  blueward  from 
the  observatory  frame  bandpasses  by  a  factor  1.1  and  denoted  01r  in  /'-band.  This  procedure  minimizes  the  uncertainties  in  the 
A'-corrections,  since  galaxies  near  the  median  redshift  are  independent  of  their  SED.  For  a  galaxy  exactly  at  z=0.1,  the 
A'-corrections  are  trivial. 

In  the  remainder  of  this  section,  we  model  the  three-dimensional  selection  function  h(r),  which  gives  the  expected  number 
density  of  galaxies  in  the  absence  of  clustering  as  a  function  of  three-dimensional  position.  For  a  uniform  magnitude  limit,  our 
selection  function  is  separable  into  the  product  of  an  angular  part  and  a  radial  part: 

h(r)  =  h(r)n(r),  (A2) 

where  r  =  rr  and  r  is  a  unit  vector.  The  angular  part  h(r)  may  take  any  value  between  0  and  1  and  gives  the  completeness  as  a 
function  of  position,  i.e.,  the  fraction  of  all  survey-selected  galaxies  for  which  survey-quality  redshifts  are  actually  obtained,  while 
h(r)  gives  the  radial  selection  function.  As  described  in  §  6,  this  separability  allows  powerful  tests  for  possible  systematic  effects 
arising  from  extinction  or  calibration  problems,  which  would  cause  a  purely  angular  modulation  of  density  fluctuations,  or  from  a 
misestimate  of  the  radial  selection  function,  which  would  cause  a  purely  radial  modulation  of  the  density.  The  SDSS  spectroscopic 
completeness  is  so  good  that  we  find  no  evidence  for  weather-related  effects  breaking  the  separability  as  in  the  2dFGRS  (Colless 
et  al.  2001)  and  therefore  do  not  need  to  perform  corrections  for  this  effect  as  in  THX02.  We  now  describe  our  modeling  of  the  two 
factors  h(r)  and  n(r)  in  turn. 

A2.  THE  ANGULAR  SELECTION  FUNCTION 
A2.1.  Specification 

The  geometry  of  the  survey  is  somewhat  complex  because  of  the  fact  that  the  imaging  and  spectroscopy  programs  are  carried  out 
concurrently.  To  supply  targets  for  the  spectroscopic  program,  periodically  a  “target  chunk”  of  imaging  data  is  processed,  calibrated, 
and  has  targets  selected.  These  target  chunks  never  overlap,  so  that  once  a  set  of  targets  is  defined  in  a  particular  region  of  sky,  it  never 
changes  in  that  region.  Thus,  the  task  of  determining  the  selection  limits  used  in  any  region  reduces  to  tracking  how  the  target  chunks 
cover  the  sky.  This  list  of  target  chunks,  their  boundaries,  and  their  selection  criteria  is  an  important  product  of  sample  1 1. 

To  drill  spectroscopic  plates  that  have  fibers  on  these  targets,  we  define  “tiling  chunks,”  which  in  principle  can  overlap  more 
than  one  target  chunk.  The  tiling  algorithm  (Blanton  et  al.  2003b)  is  then  run  in  order  to  position  tiles  and  assign  fibers  to  them, 
after  which  plates  are  designed  (that  is,  any  available  fibers  are  assigned  to  various  classes  of  auxiliary  targets,  as  well  as  to  sky  and 
calibration  stars)  and  then  drilled.  In  general,  these  tiling  chunks  will  overlap  because  we  want  the  chance  to  assign  fibers  to  targets 
that  may  have  been  in  adjacent,  earlier  tiling  chunks  but  were  not  assigned  a  fiber.  For  a  target  to  be  covered  by  a  particular  tile,  it 
must  be  in  the  same  tiling  chunk  as  that  tile  and  be  within  the  area  of  the  tile  itself  (because  the  edges  of  tiles  can  extend  beyond  the 
tiling  chunk  boundaries,  a  particular  region  of  sky  can  be  within  the  area  of  a  tile  but  not  “covered”  by  it  as  defined  here).  We  then 
divide  the  survey  into  a  number  of  “sectors,”  regions  that  are  covered  by  a  unique  set  of  tiles  and  tiling  chunks  (following  the 
nomenclature  of  the  2dFGRS,  Percival  et  al.  2001,  the  “sectors”  are  the  same  as  the  “overlap  regions”  defined  in  Blanton  et  al. 
2001).  For  instance,  two  tiles  overlapping  each  other  but  no  other  tiles  give  rise  to  three  sectors:  the  area  covered  only  by  the  first 
tile,  the  area  covered  only  by  the  second  tile,  and  the  area  covered  by  both. 

For  samplell  the  sky  area  is  covered  by  669  circular  tiles  of  diameter  2?98  and  is  split  into  2489  disjoint  sectors.  This 
decomposition  is  convenient  since  each  sector  has  a  unique  sampling  rate.  The  sampling  rate  of  a  sector  is  defined  as  the  number  of 
redshifts  of  galaxy  targets  obtained  in  the  sector  (including  the  galaxies  assigned  the  redshift  of  a  neighbor  because  of  a  fiber 
collision)  divided  by  the  number  of  galaxy  targets  in  the  sector.  The  sampling  rate  so  calculated  is  about  95%  on  average  across  the 
survey  area;  about  95%  of  the  survey  area  has  completeness  greater  than  90%.  This  is  illustrated  in  Figure  1. 

Two  additional  sets  of  geometric  entities  affect  the  angular  selection  function  n(r):  it  vanishes  inside  each  of  55  rectangular  holes 
(regions  masked  out  because  of  bad  data  quality  or  tiling  bugs  from  early  in  the  survey)  and  outside  the  official  survey  region 
defined  by  83  rectangular  bounding  boxes  (the  boundaries  of  the  target  chunks).  In  summary,  the  angular  selection  function  h(r) 
equals  the  sampling  fraction  when  inside  the  survey  area  and  is  zero  otherwise. 

An  additional  complication  when  one  is  evaluating  n(r)  is  that  the  above-mentioned  geometric  entities  are  specified  in  three 
different  coordinate  systems  in  various  combinations:  equatorial  coordinates  (right  ascension,  declination),  SDSS  survey  coordi¬ 
nates  (77,  X),  and  SDSS  great  circle  coordinates  (/i,  v). 
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A2.2.  Spherical  Polygon  Representation 

Fortunately,  we  can  convert  the  specification  of  h(r)  into  an  equivalent  but  much  simpler  form  in  terms  of  spherical  polygons 
that  encodes  all  these  complications.  This  simplification  is  necessary  since  our  power  spectrum  estimation  method  involves  the 
complex  task  of  expanding  h(r)  in  spherical  harmonics. 

All  tile,  hole,  and  bounding  box  boundaries  are  simple  arcs  on  the  celestial  sphere,  corresponding  to  the  intersection  of  the  sphere 
with  some  appropriate  plane.  This  means  that  any  convex  spherical  polygon  (a  tile,  hole,  bounding  box,  sector,  etc.)  can  be  defined  as 
the  intersection  of  a  set  of  “caps,”  where  a  cap  is  the  set  of  directions  r  satisfying  a-r  >  b  for  some  unit  vector  a  and  some  constant 
b  £  [—1, 1].  Think  of  a  cap  as  the  area  defined  by  a  plane  slicing  a  sphere.  For  instance,  a  tile  is  a  single  cap,  and  a  rectangular  hole  is  the 
intersection  of  four  caps.  The  angular  selection  function  h(r)  (plotted  in  Fig.  1)  can  be  clearly  represented  as  a  list  of  nonoverlapping 
polygons  such  that  h(r )  is  constant  in  each  one.  We  construct  the  polygon  list  by  using  the  MANGLE33  software  described  in 
Hamilton  &  Tegmark  (2003),  which  involves  the  following  steps: 

1.  We  generate  a  list  of  807  polygons  comprising  669  tiles,  83  bounding  boxes,  and  55  holes. 

2.  Whenever  two  polygons  intersect,  we  split  them  into  nonintersecting  parts,  thereby  obtaining  a  longer  list  of  8484  nonover¬ 
lapping  polygons.  Although  slightly  tricky  in  practice,  such  an  algorithm  is  easy  to  visualize:  if  you  draw  all  boundary  lines  on  a 
sphere  and  give  it  to  a  child  as  a  coloring  exercise,  using  four  crayons  and  not  allowing  identically  colored  neighbors,  you  would 
soon  be  looking  at  such  a  list  of  nonoverlapping  polygons. 

3.  We  compute  the  completeness  n(r )  for  each  of  these  new  polygons,  by  using  the  scheme  described  in  §  A.2.1. 

4.  We  simplify  the  result  by  omitting  polygons  with  zero  weight  and  merging  adjacent  polygons  that  have  identical  weight. 

The  result  is  a  list  of  2914  polygons  with  a  total  (unweighted)  area  of  2499  deg2  and  an  effective  (weighted)  area  J  h(r)  dfl  of 
2417  deg2.  These  polygons  are  sometimes  sectors,  sometimes  parts  of  sectors.  This  angular  completeness  map  and  the  polygons 
into  which  it  resolves  are  illustrated  in  Figure  1. 

A3.  IMPOSING  A  UNIFORM  MAGNITUDE  LIMIT 

As  mentioned  above,  the  faint  magnitude  limit  varies  in  a  known  way  from  target  chunk  to  target  chunk  and  is  hence  a  known 
constant  in  each  of  our  2914  polygons.  We  construct  a  uniform  galaxy  sample  that  is  complete  down  to  a  limiting  magnitude  r\\m  by 
applying  the  following  two  cuts: 

1.  Reject  all  galaxies  whose  extinction-corrected  magnitude  r  is  fainter  than 

2.  Reject  all  sectors  whose  extinction-corrected  magnitude  limit  is  brighter  than  ri;m. 

Figure  39  shows  the  number  of  surviving  galaxies  as  a  function  of  r\[m.  As  we  increase  7'ijm,  the  first  cut  eliminates  fewer  galaxies 
whereas  the  second  cut  eliminates  more  galaxies.  The  result  is  seen  to  be  a  curve  with  peaks  at  17.50,  17.60,  17.62,  17.67,  and  17.77, 
corresponding  to  the  five  magnitude  limits  used  in  spectroscopic  target  selection  during  the  course  of  the  survey.  To  maximize  our 
sample  size,  we  choose  to  cut  at  the  highest  peak  (?'iim  =  17.50).  This  gives  a  sample  of  157,389  galaxies,  denoted  safeO  in  Table  1. 
Since  the  optimal  cut  of  1 7.50  also  happens  to  be  the  brightest  magnitude  limit  used,  we  need  not  reject  any  sectors  (as  per  cut  2),  so  the 
angular  footprint  of  this  uniform  subsample  has  the  same  area  as  that  of  the  full  sample.  As  the  survey  progresses  further  with  its 
current  magnitude  limit  of  17.77,  this  will  eventually  become  the  limit  that  yields  the  largest  sample. 


33  Available  at  http://www/http://casa. colorado.edu/  ~ajsh/mangle. 
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Fig.  39. — Jagged  curve  shows  number  of  galaxies  surviving  as  a  function  of  uniform  magnitude  cut  and  is  approximately  shaped  as  the  product  of  the  two  other 
curves,  which  corresponds  to  our  two  cuts:  the  rising  curve  shows  the  number  of  galaxies  whose  magnitude  is  brighter  than  rijm,  and  the  falling  staircase-shaped 
curve  shows  the  number  of  galaxies  in  sectors  whose  magnitude  limit  is  fainter  than  njm.  [See  the  electronic  edition  of  the  Journal  for  a  color  version  of  this  figure .] 
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A4.  THE  RADIAL  SELECTION  FUNCTION 

It  is  important  to  estimate  the  radial  selection  function  h(r)  as  accurately  as  possible,  since  errors  in  it  translate  into  spurious 
large-scale  power.  Our  estimate  is  plotted  in  Figure  2  and  is  computed  as  follows. 

The  radial  window  function  of  sample  1 1  is  defined  by  the  galaxy  luminosity  function,  the  flux  limits,  and  the  absolute  magnitude 
limits  of  the  sample  in  question.  As  noted  above,  our  sample  is  limited  at  bright  and  faint  apparent  magnitudes:  14.5  <  r  <  17.5. 
Thus,  at  any  given  redshift  we  can  observe  galaxies  only  in  a  given  absolute  magnitude  range.  When  making  cuts  based  on  absolute 
magnitude,  we  use  the  quantity  01r  described  in  Blanton  (2003a),  which  refers  to  the  r-band  magnitude  X-corrected  to  its  z  =  0.1 
value.  Thus,  evolution-corrected  absolute  magnitudes  M  are  calculated  from  apparent  magnitudes  m  as  follows: 

M  =  m  -  DM  (z)  -  K0,  x  (z)  +  Q(z) .  ( A3 ) 


Here  A'o.i(z)  is  the  galaxy  X-correction,  as  calculated  by  using  the  code  of  Blanton  et  al.  (2003a),  kcorrect  vl_ll.  DM  (z)  = 
5  log  [r(l  +  z)]  +  25  is  the  distance  modulus  (see  Hogg  1999),  where  (1  +  z)r  is  the  luminosity  distance.  Q(z)  accounts  for  the  average 
evolution  in  galaxy  luminosities  in  the  recent  past,  and  we  use  the  fit  Q(z)  =  1.6 (z  —  0.1).  We  will  measure  evolution  in  detail  for 
dilferent  galaxy  types  in  future  papers;  however,  for  the  present  work,  this  simple  fit  for  the  evolution  of  all  galaxies  is  sufficient. 
At  any  redshift,  the  fraction  of  objects  in  this  absolute  magnitude  range  ;m  that  are  in  the  sample  is 


/(z) 


where  $(M)  is  the  luminosity  function  (number  density  of  objects  per  unit  magnitude)  and 

Mbri(z)  =  max  [Mmin,  14.5  -  DM(z)  -  AT0.i  (z)  +  Q(z)], 
Miim(z)  =  min  [Mmax,  17.5  -  DM(z)  -  X0.i  (z)  +  Q(z)] . 


(A4) 


(A5) 


In  this  context,  ATo.i  (z)  is  determined  by  using  the  mean  galaxy  SED  in  the  sample.  Equations  (A4)  and  (A5)  simply  express  the  fact 
that  a  galaxy  must  lie  in  our  apparent  magnitude  range  and  in  our  absolute  magnitude  range  to  be  included  in  the  sample.  The 
luminosity  function  for  our  sample  is  determined  in  the  manner  described  by  Blanton  et  al.  (2001b),  by  using  the  step-wise  maximum 
likelihood  method  of  Efstathiou,  Ellis,  &  Peterson  (1988),  again  by  using  Q(z )  =  1.6(z  —  0.1). 

We  transform  the  galaxy  positions  into  the  Local  Group  frame,  assuming  that  the  solar  motion  relative  to  the  Local  Group  is 
306  km  s_1  toward  /  =  99°,  b  =  —4°  (Courteau  &  van  den  Bergh  1999). 


APPENDIX  B 

POWER  SPECTRUM  ESTIMATION  DETAILS 

In  this  Appendix,  we  describe  our  power  spectrum  estimation  procedure  in  sufficient  detail  for  the  reader  interested  in  repro¬ 
ducing  our  analysis. 


Bl.  PARAMETERIZING  OUR  PROBLEM 

We  parameterize  the  ratio  of  the  three  power  spectra  Pgg{k),  Pgv{k),  and  Pm(k )  to  the  prior  as  piecewise  constant  functions,  each 
with  97  “steps.”  Doing  this  rather  than  taking  the  power  spectrum  itself  to  be  constant  avoids  unnecessarily  jagged  spectra  as 
discussed  in  THX02.  The  resulting  parameters  p,  are  termed  the  “band  powers.”  As  long  as  the  prior  agrees  fairly  well  with  the 
measured  result,  this  has  the  advantage  of  giving  better  behaved  window  functions,  as  described  in  Hamilton  &  Tegmark  (2000). 

We  group  these  3  x  97  numbers  into  a  291 -dimensional  vector  p.  We  choose  our  97  k-bands  to  be  centered  on  logarithmically 
equispaced  A'-values  k,  =  10^  '~65)/16  h  Mpc-1,  i  =  1, .  .  .,  97,  i.e.,  ranging  from  0.0001  to  100  h  Mpc-1.  This  should  provide  fine 
enough  A-resolution  to  resolve  any  spectral  features  that  may  be  present  in  the  power  spectrum.  For  instance,  baryon  wiggles  have  a 
characteristic  scale  on  the  order  of  A k  ~  0.1,  so  we  oversample  the  first  one  around  k  ~  0.1  by  a  factor 
Ak/(k26  —  A'25)  ~  16/  In  10  ~  7. 

This  parameterization  means  that  we  can  write  the  pixel  covariance  matrix  of  equation  (5)  as 

291 

C  =  £>iC,/,  (Bl) 

;=0 

where  the  derivative  matrix  C„  =  DC / dp,  is  the  contribution  to  the  covariance  matrix  from  the  /th  band.  For  notational  conve¬ 
nience,  we  have  included  the  noise  term  in  equation  (Bl)  by  defining  C, 0  =  N,  corresponding  to  an  extra  dummy  parameter po  =  1 
giving  the  shot  noise  normalization. 

B2.  QUADRATIC  ESTIMATOR  BASICS 

Quadratic  estimators  were  originally  derived  for  galaxy  survey  applications  (Hamilton  1997a,  1997b).  They  were  accelerated 
and  first  applied  to  CMB  analysis  (Tegmark  1997b;  Bond  et  al.  2000)  and  have  been  a  cornerstone  of  almost  all  recent  CMB  power 
spectrum  analyses. 
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Our  quadratic  estimators  pi  defined  by  equation  (23)  are  designed  to  measure  the  corresponding  parameters  p,.  We  choose  the 
Q  matrices  to  be  of  the  form 

j  m 

Q,  =  (B2) 

7=1 

where  m  =  291  is  the  number  of  bands  (power  spectrum  parameters).  M  is  an  m  x  m  matrix  that  we  specify  below.  In  the 
approximation  that  the  pixelized  data  have  a  Gaussian  probability  distribution  (a  good  approximation  in  our  case  because  we  are 
mostly  in  the  linear  regime),  the  choice  of  equation  (B2)  has  been  shown  to  be  lossless,  distilling  all  the  power  spectrum 
information  from  the  original  data  set  into  the  quadratic  estimator  vector  p  (Tegmark  1997b).  This  is  true  for  any  choice  of  the 
matrix  M  as  long  as  it  is  invertible:  the  result  from  using  a  different  matrix  M'  could  always  be  computed  afterward  by  multiplying 
the  vector  p  by  M'M~l . 

The  quadratic  estimators  pi  have  the  additional  advantage  (as  compared  with,  e.g.,  maximum  likelihood  estimators)  that  their 
statistical  properties  are  easy  to  compute:  their  mean  and  covariance  are  given  by  equations  (24)  and  (25),  where  the  window 
matrix  W  and  the  covariance  matrix  S  are 

Wy  =  tr  [QiC^],  (B3) 

S,y  =  2tr  [Q/CQjC].  (B4) 

By  substituting  equation  (B2),  this  gives 

W  =  MF ,  (B5) 

S  =  MFM',  (B6) 

where  F  is  the  Fisher  information  matrix  (Fisher  1935;  Tegmark,  Taylor,  &  Heavens  1997) 

Fij  =  ^ti[C-1C,iC-lC,j].  (B7) 

In  conclusion,  the  quadratic  estimator  method  takes  the  vector*  and  its  covariance  matrix  C  from  Figure  10  and  compresses  it  into 
the  shorter  vector  p  in  Figure  14  and  its  covariance  matrix  while  retaining  essentially  all  the  cosmological  information. 

B3.  QUADRATIC  ESTIMATOR  VARIANTS:  CHOOSING  THE  M  MATRIX 

For  the  purpose  of  fitting  models  p  to  our  measurements  p,  we  are  already  finished — equations  (24)  and  (25)  tell  us  how  to  compute 
X2  for  any  given  p ,  and  the  result, 

X2  =  {p-{p))"Z-\p-{p))\  (B8) 

is  independent  of  the  choice  of  M.  However,  since  one  of  the  key  goals  of  our  analysis  is  to  provide  model-independent 
measurement  of  the  three  power  spectra  themselves,  the  choice  of  M  is  crucial.  Ideally,  we  would  like  both  uncorrelated  error  bars 
(diagonal  X)  and  well-behaved  (narrow,  unimodal,  and  nonnegative)  window  functions  W  that  do  not  mix  the  three  power  spectra, 
W  =  /  being  the  ideal. 

There  are  two  separate  issues  of  interest  for  choosing  M.  The  first  involves  the  trade-off  between  making  S  well  behaved  and 
making  W  well  behaved.  The  second  involves  the  complication  that  we  are  measuring  three  power  spectra  rather  than  one  and  that 
quadratic  estimators  tend  to  mix  them,  with  estimators  of  one  spectrum  being  contaminated  by  “leakage”  from  another.  The 
following  two  subsections  discuss  these  two  issues  in  turn.  For  all  choices  below,  we  wish  each  window  function  (row  of  W)  to 
sum  to  unity  so  that  we  can  interpret  /),  as  measuring  a  weighted  average  of  the  true  power.  Because  of  equation  (B5),  the  rows  of 
M  are  therefore  normalized  to  satisfy 

j2(MF)ij  = 1  (fi9) 


for  all  i. 


B3.1.  Correlated,  Anticorrelated,  and  Uncorrelated  Band  Powers 

There  are  a  number  of  interesting  choices  ofM  that  each  have  their  pros  and  cons  (Tegmark  &  Hamilton  1998;  Hamilton  &  Tegmark 
2000).  The  simple  choice  in  whichM  is  diagonal  gives  the  “best  guess”  estimates  in  the  sense  of  having  minimum  variance  (Hamilton 
1997a;  Tegmark  1997a;  Bond  et  al.  2000)  and  also  has  the  advantage  of  being  independent  of  the  number  of  bands  used  in  the  limit  of 
high  spectral  resolution.  It  was  used  for  Figures  13  and  15.  Here  the  window  functions  are  simply  the  rows  of  the  Fisher  matrix  and  are 
seen  to  be  rather  broad.  All  entries  of  F  are  guaranteed  to  be  positive  as  proven  in  PTH0 1 ,  which  means  not  only  that  all  windows  are 
positive  (which  is  good)  but  also  that  all  measurements  are  positively  correlated  (which  is  bad). 

Another  interesting  choice  is  (Tegmark  1 997b)  M  =  F  1 ,  which  gives  W  =  I.  In  other  words,  all  window  functions  are  Kronecker 
delta  functions,  and  p  gives  completely  unbiased  estimates  of  the  band  powers,  with  (pi)  =  />,  regardless  of  what  values  the  other  band 
powers  take.  This  gives  an  estimate p  similar  to  the  maximum  likelihood  method  (Tegmark  et  al.  1 998),  and  the  covariance  matrix  of 
equation  (25)  reduces  to  F  1 .  A  serious  drawback  of  this  choice  is  that  if  we  have  sampled  the  power  spectrum  on  a  scale  finer  than  the 
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inverse  survey  size  in  an  attempt  to  retain  all  information  about  wiggles,  etc.,  this  covariance  matrix  tends  to  give  substantially  larger 
error  bars  (Ap,-  =  M*/2  =  [(F-1)^]1^2)  than  the  first  method,  anticorrelated  between  neighboring  bands. 

The  two  above-mentioned  choices  for  M  both  tend  to  produce  correlations  between  the  band  power  error  bars.  The  minimum- 
variance  choice  generally  gives  positive  correlations,  since  the  Fisher  matrix  cannot  have  negative  elements,  whereas  the  unbiased 
choice  tends  to  give  anticorrelation  between  neighboring  bands.  The  choice  (Tegmark  &  Hamilton  1998;  Hamilton  &  Tegmark  2000) 
M  =  F  1  2  with  the  rows  renormalized  has  the  attractive  property  of  making  the  errors  uncorrelated,  with  the  covariance  matrix  of 
equation  (25)  diagonal.  The  corresponding  window  functions  W  are  plotted  in  Figure  16  and  are  seen  to  be  quite  well  behaved,  even 
narrower  than  those  in  Figure  15  while  remaining  positive  in  almost  all  cases.34  This  choice,  which  is  the  one  we  make  in  this  paper, 
is  a  compromise  between  the  first  two:  it  narrows  the  minimum-variance  window  functions  at  the  cost  of  only  a  small  noise  increase, 
with  uncorrelated  noise  as  an  extra  bonus.  The  minimum-variance  band  power  estimators  are  essentially  a  smoothed  version  of 
the  uncorrelated  ones,  and  their  lower  variance  was  paid  for  by  correlations  that  reduced  the  effective  number  of  independent 
measurements. 


B3.2.  Disentangling  the  Three  Power  Spectra 

The  fact  that  we  are  measuring  three  power  spectra  rather  than  one  introduces  an  additional  complication.  As  illustrated  by  Figure  18, 
an  estimate  of  the  power  in  one  of  the  three  spectra  generally  picks  up  unwanted  contributions  (“leakage”)  from  the  other  two,  making 
it  complicated  to  interpret.  Although  the  above-mentioned  F_1  method  in  principle  eliminates  leakage  completely  (since  it  gives 
W  =  /),  the  cost  in  terms  of  increased  error  bars  is  found  to  be  prohibitive.  We  therefore  follow  HTP00  and  THX02  in  adopting  the 
following  procedure  for  disentangling  the  three  power  spectra:  For  each  of  the  97  A:-bands,  we  take  linear  combinations  of  the  gg,  gv,  and 
vv  measurements  such  that  the  unwanted  parts  of  the  window  functions  average  to  zero.  This  procedure  is  mathematically  identical  to 
that  used  in  Tegmark  &  de  Oliveira-Costa  (2001)  for  separating  different  types  of  CMB  polarization,  so  the  interested  reader  is  referred 
there  for  the  explicit  equations.  For  the  reader  familiar  with  Bayesian  statistics,  our  disentanglement  procedure  is  tantamount  to 
marginalizing  over  the  amplitudes  of  the  other  two  power  spectra,  separately  for  each  band. 

The  procedure  is  illustrated  in  Figure  18  and  by  construction  has  the  property  that  leakage  is  completely  eliminated  if  the  true 
power  has  the  same  shape  (not  necessarily  the  same  amplitude)  as  the  prior.  We  find  that  this  method  works  quite  well  (in  the  sense 
that  the  unwanted  windows  do  not  merely  average  to  zero)  for  the  most  accurately  measured  band  powers,  in  particular  the  central 
parts  of  the  gg  spectrum,  with  slightly  poorer  leakage  elimination  for  bands  with  larger  error  bars. 

The  window  functions  plotted  in  Figure  24  are  the  gg  windows  after  disentanglement.  Note  that  although  our  disentanglement 
introduces  correlations  between  the  gg,  gv,  and  vv  measurements  for  a  given  A'-band,  different  A'-bands  remain  uncorrelated. 

B4.  NUMERICAL  ISSUES 

Our  analysis  pipeline  has  a  few  “knobs”  that  can  be  set  in  more  than  one  way.  This  section  discusses  the  sensitivity  to  such 
settings. 

B4.1.  Prior 

The  analysis  method  employed  assumes  a  “prior”  power  spectrum  via  equation  (Bl),  both  to  compute  band  power  error  bars 
and  to  find  the  galaxy  pair  weighting  that  minimizes  them.  An  iterative  approach  was  adopted  by  starting  with  a  simple  BBKS 
model  for  Pgg( k)  (Bardeen  et  al.  1986),  then  shifting  it  vertically  and  horizontally  to  better  fit  the  resulting  measurements  and 
recomputing  the  measurements  a  second  time.  As  priors  for  Pgv(k)  and  P,w(k)  we  use  equations  (21)  and  (22)  with  r  =  1  and 
/?  =  0.5,  which  provides  a  good  fit  to  the  measurements. 

To  what  extent  does  this  choice  of  prior  affect  the  results?  On  purely  theoretical  grounds  (e.g.,  Tegmark  et  al.  1997),  one  expects 
a  grossly  incorrect  prior  to  give  unbiased  results  but  with  unnecessarily  large  variance.  If  the  prior  is  too  high,  the  sample-variance 
contribution  to  error  bars  will  be  overestimated  and  vice  versa.  This  hypothesis  has  been  extensively  tested  and  confirmed  in  the 
context  of  power  spectrum  measurements  from  both  the  CMB  (e.g.,  Bunn  1995)  and  galaxy  redshift  surveys  (PTH01),  confirming 
that  the  correct  result  is  recovered  on  average  even  when  a  grossly  incorrect  prior  is  used.  In  our  case,  the  prior  by  construction 
agrees  quite  well  with  the  actual  measurements  (see  Fig.  14),  so  the  quoted  error  bars  should  be  reliable  as  well. 

B4.2.  Effect  of  Changing  the  Number  of  PKL  Modes 

We  have  limited  our  analysis  to  the  first  N  =  4000  PKL  modes  whose  angular  part  is  spanned  by  spherical  harmonics  with  £  <  40. 
This  choice  was  a  trade-off  between  the  desire  to  capture  as  much  information  as  possible  about  the  galaxy  survey  and  the  need  to  stay 
away  from  small  scales  on  which  nonlinear  effects  invalidate  the  Kaiser  approximation  to  redshift  distortions.  To  quantify  our 
sensitivity  to  these  choices,  we  repeated  the  entire  analysis  using  the  first  500,  1000,  2000,  and  4000  modes.  Our  power  spectrum 
measurements  on  the  very  largest  scales  were  recovered  even  with  merely  500  modes.  As  we  added  more  and  more  modes  (more  and 
more  small-scale  information),  the  power  measurements  converged  to  those  in  Figure  14  for  larger  and  larger  k.  Figure  12  shows  that 
our  4000  PKL  modes  are  all  rather  insensitive  to  cosmological  information  for  k  0.2. 

B4.3.  Convergence  Issues 

A  key  step  in  our  analysis  pipeline  is  the  computation  of  the  matrices  Pi  =  OTi/dpt  that  give  the  contribution  to  the  signal 
covariance  matrix  section  from  the  z'th  band  power.  This  computation  involves  a  summation  over  multipoles  I  that  should,  strictly 
speaking,  run  from  I  =  0  to  oo,  since  the  angular  completeness  map  itself  has  sharp  edges  involving  harmonics  to  I  =  oo.  In 


34  The  reader  interested  in  mathematical  challenges  will  be  interested  to  know  that  it  remains  a  mystery  to  the  authors  why  this  /'’  1/2  method  works  so  well.  We 
have  been  unable  to  prove  that  the  resulting  window  matrix  /’’ 1  /2  has  no  negative  elements  (indeed,  counterexamples  can  be  constructed),  yet  the  method  works  like 
magic  in  practice  in  all  LSS  and  CMB  applications  we  have  tried. 
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Fig.  40. — Numerical  convergence.  The  figure  shows  for  how  many  of  our  4000  PKL  modes  the  numerical  calculations  are  converged  to  accurately  measure  the 
power  up  to  a  given  wavenumber  k.  Left  to  right:  The  12  curves  correspond  to  truncation  at  £cut  =  20,  40,  60,  80,  100,  120,  140,  160,  180,  200,  220,  and  240. 


practice,  this  summation  must  of  course  be  truncated  at  some  finite  multipole  £cai-  To  quantify  the  effect  of  this  truncation,  we 
plot  the  diagonal  elements  of  the  P  matrices  as  a  function  of  £cut  and  study  how  they  converge  as  £cut  increases.  We  define  a 
given  PKL  mode  as  having  converged  by  some  multipole  if  subsequent  ^-values  contribute  less  than  1%  of  its  variance.  Figure  40 
plots  the  number  of  usable  PKL  modes  as  a  function  of  wavenumber  k ,  defining  a  mode  to  be  usable  for  our  analysis  only  if  it  is 
converged  for  all  smaller  wavenumbers  k'  <  k  for  all  three  power  flavors  ( Pgg ,  Pgv,  and  Pm).  We  use  £cut  =  260  in  our  final 
analysis,  since  this  guarantees  that  all  4000  modes  are  usable  for  wavenumbers  k  in  the  range  0-0.7  h  Mpc~\  i.e.,  comfortably 
beyond  the  large  scales  0-0.3  h  Mpc-1  that  are  the  focus  of  this  paper.  With  this  cutoff,  the  computation  of  the  P  matrices 
(which  scales  as  £^ut  asymptotically),  took  about  a  week  on  a  2  GHz  LINUX  workstation.  As  a  further  test,  we  repeated  our 
entire  analysis  with  £cut  =120  and  obtained  almost  indistinguishable  power  spectra. 

REFERENCES 


Abazajian,  K.,  et  al.  2003,  AJ,  126,  2081 

Bardeen,  J.  M.,  Bond,  J.  R.,  Kaiser,  N.,  &  Szalay,  A.  S.  1986,  ApJ,  304,  15 
Bennett,  C.  L„  et  al.  2003,  ApJS,  148,  1 
Blanton,  M.  R.  2000,  ApJ,  544,  63 

Blanton,  M.  R.,  Brinkmann,  J.,  Csabai,  I.,  Doi,  M.,  Eisenstein,  D.  J., 
Fukugita,  M.,  Gunn,  J.  E„  Hogg,  D.  W.,  &  Schlegel,  D.  J.  2003a,  AJ,  125, 2348 
Blanton,  M.  R.,  Cen,  R.,  Ostriker,  J.  R,  &  Strauss,  M.  A.  1999,  ApJ,  522,  590 
Blanton,  M.  R„  Cen,  R„  Ostriker,  J.  R,  Strauss,  M.  A.,  &  Tegmark,  M.  2000, 
ApJ,  531,  1 

Blanton,  M.  R„  Lupton,  R.  H.,  Maley,  F.  M„  Young,  N.,  Zehavi,  I.,  &  Loveday, 
J.  2003b,  AJ,  125,  2276 

- .  2001,  AJ,  121,  2358 

- .  2003c,  ApJ,  592,  819 

- .  2003d,  ApJ,  594,  186 

Bond,  J.  R.,  Jaffe,  A.  H„  &  Knox,  L.  E.  2000,  ApJ,  533,  19 
Bridle,  S.,  Lahav,  O.,  Ostriker,  J.  P.,  &  Steinhardt,  P.  J.  2003,  Science, 
299,  1532 

Bryan,  G.,  &  Norman,  M.  1998,  ApJ,  495,  80 
Budavari,  T.,  et  al.  2003,  ApJ,  595,  59 
Bunn,  E.  F.  1995,  Ph.D.  thesis,  Univ.  California,  Berkeley 
Cen,  R„  &  Ostriker,  J.  P.  2000,  ApJ,  538,  83 
Cole,  S.,  Fisher,  K.  B„  &  Weinberg,  D.  H.  1994,  MNRAS,  267,  785 
Coles,  P.  1993,  MNRAS,  262,  1065 
Coles,  P.,  Melott,  A.,  &  Munshi,  D.  1999,  ApJ,  521,  L5 
Colless,  M„  et  al.  2001,  MNRAS,  328,  1039 
Connolly,  A.,  et  al.  2002,  ApJ,  579,  42 
Courteau,  S.,  &  van  den  Bergh,  S.  1999,  AJ,  118,  337 
Croft,  R.  A.  C.,  Weinberg,  D.  H„  Pettini,  M„  Hemquist,  L.,  &  Katz,  N.  1999, 
ApJ,  520,  1 

Davis,  M.,  &  Geller,  M.  J.  1976,  ApJ,  208,  13 

Davis,  M„  Meiksin,  A.,  Strauss,  M.  A.,  da  Costa,  L.  N„  &  Yahil,  A.  1988, 
ApJ,  333,  L9 

Davis,  M.,  Newman,  J.  A.,  Faber,  S.  M.,  &  Phillips,  A.  C.  2001,  in  Deep  Fields, 
ed.  S.  Cristiani,  A.  Renzini,  &  R.  E.  Williams  (Dordrecht:  Springer),  241 
Dekel,  A.,  &  Lahav,  O.  1999,  ApJ,  520,  24 
Dodelson,  S.,  et  al.  2002,  ApJ,  572,  140 
Dressier,  A.  1980,  ApJ,  236,  351 

Durrer,  R.,  Gabrielli,  A.,  Joyce,  M.,  &  Labini,  F.  S.  2003,  ApJ,  585,  LI 
Efstathiou,  G.,  &  Bond,  J.  R.  1999,  MNRAS,  304,  75 


Efstathiou,  G.,  Ellis,  R.  S.,  &  Peterson,  B.  A.  1988,  MNRAS,  232,  431 
Efstathiou,  G„  &  Moody,  S.  J.  2001,  MNRAS,  325,  1603 
Eisenstein,  D.  J.,  &  Hu,  W.  1999,  ApJ,  511,  5 
Eisenstein,  D.  J.,  Hu,  W.,  &  Tegmark,  M.  1998,  ApJ,  504,  L57 

- .  1999,  ApJ,  518,  2 

Eisenstein,  D.  J.,  et  al.  2001,  AJ,  122,  2267 
Evrard,  A.  E.,  et  al.  2002,  ApJ,  573,  7 
Falco,  E.  E.,  et  al.  1999,  PASP,  111,  438 

Feldman,  H.  A.,  Kaiser,  N.,  &  Peacock,  J.  A.  1994,  ApJ,  426,  23 
Fisher,  K.  B„  Davis,  M.,  Strauss,  M.  A.,  Yahil,  A.,  &  Huchra,  J.  P.  1993, 
ApJ,  402.  42 

Fisher,  K.  B.,  Scharf,  C.  A.,  &  Lahav,  O.  1994,  MNRAS,  266,  219 
Fisher,  R.  A.  1935,  J.  R.  Statistical  Soc.,  98,  39 
Frenk,  C.  S.,  et  al.  2000,  preprint  (astro-ph/0007362) 

Fry,  J.  N.  1996,  ApJ,  461,  L65 

Fry,  J.  N.,  &  Gaztanaga,  E.  1993,  ApJ,  413,  447 

Fukugita,  M.,  Ichikawa,  T.,  Gunn,  J.  E.,  Doi,  M.,  Shimasaku,  K.,  &  Schneider, 
D.  P.  1996,  AJ,  111,  1748 
Giavalisco,  M.,  et  al.  1998,  ApJ,  503,  543 
Gnedin,  N.  Y„  &  Hamilton,  A.  J.  S.  2002,  MNRAS,  334,  107 
Goldberg,  D.  M.,  &  Strauss,  M.  A.  1998,  ApJ,  495,  29 
Gunn,  J.  E„  et  al.  1998,  AJ,  116,  3040 
Hamilton,  A.  J.  S.  1988,  ApJ,  331,  L59 

- .  1997a,  MNRAS,  289,  285 

- .  1997b,  MNRAS,  289,  295 

- .  1998,  in  The  Evolving  Universe,  ed.  D.  Hamilton  (Dordrecht: 

Kluwer),  185 

- .  2001,  MNRAS,  322,  419 

Hamilton,  A.  J.  S„  &  Tegmark,  M.  2000,  MNRAS,  312,  285 

- .  2002,  MNRAS,  330,  506 

- .  2003,  preprint  (astro-ph/0306324) 

Hamilton,  A.  J.  S.,  Tegmark,  M„  &  Padmanabhan,  N.  2000,  MNRAS,  317,  L23 
(HTP00) 

Hannestad,  S.  2003,  J.  Cosmol.  Astropart.  Phys.,  5,  4 
Hatton,  S.,  &  Cole,  S.  1998,  MNRAS,  296,  10 
Hawkins,  E„  et  al.  2003,  MNRAS,  346,  78 

Heavens,  A.  F„  Matarrese,  S.,  &  Verde,  L.  1998,  MNRAS,  301,  797 
Heavens,  A.  F.,  &  Taylor,  A.  N.  1995,  MNRAS,  275,  483 
Hinshaw,  G.,  et  al.  2003,  ApJS,  148,  135 


740 


TEGMARK  ET  AL. 


Hoekstra,  H„  Yee,  H„  &  Gladders,  M.  2002,  ApJ,  577,  595 
Hogg,  D.  W.  1999,  preprint  (astro-ph/9905 1 1 6) 

Hogg,  D.  W.,  Finkbeiner,  D.  P.,  Schlegel,  D.  J„  &  Gunn,  J.  E.  2001,  AJ, 
122,  2129 

Hu,  W.,  Eisenstein,  D.  J.,  &  Tegmark,  M.  1998,  Phys.  Rev.  Lett.,  80,  24 
Huchra,  J.  R,  Geller,  M.  J.,  de  Lapparent,  V.,  &  Corwin,  H.  G.,  Jr.  1990,  ApJS, 
72,  433 

Kaiser,  N.  1987,  MNRAS,  227,  1 
Lahav,  0.,  et  al.  2002,  MNRAS,  333,  961 

Le  Fevre,  O.,  et  al.  2001,  in  Deep  Fields,  ed.  S.  Cristiani,  A.  Renzini,  & 
R.  E.  Williams  (Dordrecht:  Springer),  236 
Linde,  A.  1994,  Sci.  Am.,  271,  32 

Lineweaver,  C.  H.,  Tenorio,  L.,  Smoot,  G.  F.,  Keegstra,  P.,  Banday,  A.  J.,  & 
Lubin,  P.  1996,  ApJ,  470,  38 

Lupton,  R.  H.,  Gunn,  J.  E.,  Ivezic,  Z.,  Knapp,  G.  R.,  Kent,  S.,  &  Yasuda,  N. 
2001,  in  ASP  Conf.  Ser.  238,  Astronomical  Data  Analysis  Software  and 
Systems,  ed.  F.  R.  Hamden  Jr.,  F.  A.  Primini,  &  H.  E.  Payne  (San  Francisco: 
ASP),  269 

Madgwick,  D„  et  al.  2002,  MNRAS,  333,  133 

Madgwick,  D.  S.,  et  al.  2003,  MNRAS,  344,  847 

Mann,  R.  G„  Peacock,  J.  A.,  &  Heavens,  A.  F.  1998,  MNRAS,  293,  209 

Miller,  C.  J.,  &  Batuski,  D.  J.  2001,  ApJ,  551,  635 

Miller,  C.  J.,  Nichol,  R.  C„  &  Batuski,  D.  J.  2001a,  ApJ,  555,  68 

- .  2001b,  Science,  292,  2302 

Miller,  C.  J.,  Nichol,  R.  C„  &  Chen,  X.  2002,  ApJ,  579,  483 
Narayanan,  V.  K.,  Berlind,  A.  A.,  &  Weinberg,  D.  H.  2000,  ApJ,  528,  1 
Norberg,  P„  et  al.  2001,  MNRAS,  328,  64 
- .  2002,  MNRAS,  336,  907 

Padmanabhan,  N.,  Tegmark,  M.,  &  Hamilton,  A.  J.  S.  2001,  ApJ,  550,  52 
Peacock,  J.  A.  2003,  in  AIP  Conf.  Proc.  666,  The  Emergence  of  Cosmic 
Structure  (New  York:  AIP),  275 
Peacock,  J.  A.,  &  Dodds,  S.  J.  1994,  MNRAS,  267,  1020 
Peacock,  J.  A.,  &  Nicholson,  D.  1991,  MNRAS,  253,  307 
Peacock,  J.  A.,  et  al.  2001,  Nature,  410,  169 
Pen,  U.  1998,  ApJ,  504,  601 

Percival,  W.  J.,  Verde,  L„  &  Peacock,  J.  A.  2004,  MNRAS,  347,  645 
Percival,  W.  J.,  et  al.  2001,  MNRAS,  327,  1297 

- .  2002,  MNRAS,  337,  1068 

Petrosian,  V.  1976,  ApJ,  209,  LI 
Pier,  J.  R.,  et  al.  2003,  AJ,  125,  1559 
Pope,  A.,  et  al.  2004,  ApJ,  in  press 
Richards,  G.  T„  et  al.  2002,  AJ,  123,  2945 


Saunders,  W.,  et  al.  2000,  MNRAS,  317,  55 
Scherrer,  R.  J.,  &  Weinberg,  D.  H.  1998,  ApJ,  504,  607 
Schlegel,  D.  J.,  Finkbeiner,  D.  P.,  &  Davis,  M.  1998,  ApJ,  500,  525 
Scoccimarro,  R.,  Feldman,  H.  A.,  Fry,  J.  N.,  &  Frieman,  J.  A.  2001,  ApJ, 
546,  652 

Scoccimarro,  R„  &  Sheth,  R.  K.  2002,  MNRAS,  329,  629 

Scranton,  R.,  et  al.  2002,  ApJ,  579.  48 

Seljak,  U.  2001,  MNRAS,  325,  1359 

Shectman,  S.  A.,  et  al.  1996,  ApJ,  470,  172 

Smith,  J.  A.,  et  al.  2002,  AJ,  123,  2121 

Smith,  R.  E„  et  al.  2003,  MNRAS,  341,  1311 

Somerville,  R.  S.,  Lemson,  G.,  Sigad,  Y.,  Dekel,  A.,  Kauffmann,  G.,  &  White, 
S.  D.  M.  2001,  MNRAS,  320,  289 
Spergel,  D.  N„  et  al.  2003,  ApJS,  148,  175 
Stoughton,  C.,  et  al.  2002,  AJ,  123,  485 
Strauss,  M.  A.,  et  al.  2002,  AJ,  124,  1810 
Szalay,  A.,  et  al.  2003,  ApJ,  591,  1 

Tadros,  H.,  Efstathiou,  G„  &  Dalton,  G.  1998,  MNRAS,  296,  995 
Tegmark,  M.  1995,  ApJ,  455,  429 

.  1997a,  Phys.  Rev.  Lett.,  79,  3806 
.  1997b,  Phys.  Rev.  D,  55,  5895 
Tegmark,  M.,  &  Bromley,  B.  C.  1999,  ApJ,  518,  L69 
Tegmark,  M.,  &  de  Oliveira-Costa,  A.  2001,  Phys.  Rev.  D,  64,  063001 
Tegmark,  M.,  &  Hamilton,  A.  J.  S.  1998,  in  Relativistic  Astrophysics  and 
Cosmology,  ed.  A.  V.  Olinto,  J.  A.  Frieman,  &  D.  Schramm  (Singapore: 
World  Scientific),  270 

Tegmark,  M.,  Hamilton,  A.  J.  S.,  Strauss,  M.  A.,  Vogeley,  M.  S.,  &  Szalay, 
A.  S.  1998,  ApJ,  499,  555 

Tegmark,  M„  Hamilton,  A.  J.  S.,  &  Xu,  Y.  2002a,  MNRAS,  335,  887  (THX02) 
Tegmark,  M.,  &  Peebles,  P.  J.  E.  1998,  ApJ,  500,  L79 
Tegmark,  M.,  Taylor,  A.  N.,  &  Heavens,  A.  F.  1997,  ApJ,  480,  22 
Tegmark,  M.,  &  Zaldarriaga,  M.  2002,  Phys.  Rev.  D,  66,  103508 
Tegmark,  M.,  et  al.  2002b,  ApJ,  571,  191 

- .  2003,  Phys.  Rev.  D,  in  press  (astro-ph/03 10723;  Paper  II) 

Verde,  L„  et  al.  2003,  ApJS,  148,  195 

Wang,  X.,  Tegmark,  M.,  Jain,  B.,  &  Zaldarriaga,  M.  2003,  Phys.  Rev.  D, 
68,  123001 

Wang,  Y.,  Spergel,  D.  N„  &  Strauss,  M.  A.  1999,  ApJ,  510,  20 
York,  D„  et  al.  2000,  AJ,  120,  1579 
Zehavi,  I.,  et  al.  2002,  ApJ,  571,  172 
- .  2003,  ApJ,  submitted  (astro-ph/0301280) 


